














































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
This exam is for individuals looking to become certified Collibra Solution Architects. It focuses on data governance solutions, architecture, implementation strategies, and how to leverage Collibra’s data management platform to build efficient and scalable solutions.
Typology: Exams
1 / 86
This page cannot be seen from the preview
Don't miss anything!















































































Question 1. Which Collibra capability primarily enables users to discover and understand data assets across the enterprise? A) Data Quality B) Data Catalog C) Reference Data Management D) Policy Management Answer: B Explanation: The Data Catalog provides searchable metadata, lineage, and business context, allowing users to locate and understand data assets. Question 2. In a Data Governance operating model, which role is chiefly responsible for defining data definitions and ensuring they remain current? A) Data Owner B) Data Steward C) Data Custodian D) Data Architect Answer: B Explanation: Data Stewards maintain business glossaries, resolve data issues, and keep definitions accurate. Question 3. Which of the following is a key driver for implementing a Data Governance program? A) Reducing IT staff headcount B) Enhancing data-driven decision making C) Eliminating all data duplication D) Automating all business processes Answer: B
Explanation: Data Governance improves data quality, trust, and accessibility, leading to better decisions. Question 4. In the RACI matrix, the “C” stands for: A) Responsible B) Accountable C) Consulted D) Informed Answer: C Explanation: “C” denotes stakeholders who are consulted for input before decisions are made. Question 5. Which Collibra module helps organizations meet GDPR requirements by tracking data lineage and retention policies? A) Data Catalog B) Data Quality C) Data Governance Center (DGC) D) Reference Data Management Answer: C Explanation: The DGC stores governance assets, lineage, and policy metadata needed for GDPR compliance. Question 6. A single‑node Collibra deployment is most appropriate for: A) Enterprise‑wide production use with high availability B) Small pilot or proof‑of‑concept environments C) Multi‑region disaster recovery D) Load‑balanced horizontal scaling Answer: B
Explanation: An Asset is any governed item, including data elements, tables, or policies. Question 10. Extending the out‑of‑the‑box metamodel should always be done: A) By deleting default attributes to simplify the model B) Using custom attributes while preserving core structures for compatibility C) By creating a new database schema D) Only after the platform reaches version 10 Answer: B Explanation: Adding custom attributes maintains compatibility and leverages built‑in functionality. Question 11. Which Collibra feature provides role‑based dashboards for executives to monitor governance KPIs? A) Views B) Reports C) Dashboards D) Workflows Answer: C Explanation: Dashboards aggregate visual widgets and are configurable per role. Question 12. A common factor that degrades Collibra search performance is: A) Excessive use of custom Java code B) Large, fragmented search indices C. Low network latency D. Single‑node deployment Answer: B
Explanation: Fragmented or oversized indices slow query response; regular index maintenance is required. Question 13. In workflow design, the “Start Event” is responsible for: A) Defining the workflow’s final state B) Triggering the workflow execution based on a condition or user action C. Assigning permissions to the workflow owner D. Archiving completed tasks Answer: B Explanation: The Start Event initiates the workflow when its trigger condition is satisfied. Question 14. Which BPMN element represents a decision point that routes execution to different paths? A) Task B) Gateway C) Event D) Sub‑process Answer: B Explanation: Gateways evaluate conditions to determine which outgoing flow to follow. Question 15. When scripting inside a Collibra workflow, which language is natively supported for advanced logic? A) Python B) Groovy C) Ruby D) C# Answer: B
Explanation: RDM handles reference data such as country codes, product classifications, and their relationships. Question 19. Which Collibra asset type is best suited for documenting data retention policies? A. Business Term B. Policy C. Data Set D. Relation Answer: B Explanation: Policies capture rules, standards, and retention requirements. Question 20. When linking a Business Term to a technical Data Asset, the relationship type used is typically: A. “Is Part Of” B. “Is Synonym Of” C. “Is Described By” D. “Is Related To” Answer: C Explanation: “Is Described By” connects business terminology to the underlying technical asset. Question 21. The Collibra REST API is limited to: A. Only read‑only operations B. Bulk loading of more than 10,000 records per call C. 2,000‑record pagination per request D. Direct database manipulation Answer: C
Explanation: The API paginates results, typically returning up to 2,000 items per page. Question 22. Collibra Connect is primarily used for: A. Visualizing data lineage graphs B. Designing low‑code integration flows with external systems C. Managing user passwords D. Running internal batch jobs Answer: B Explanation: Collibra Connect provides a low‑code platform to orchestrate data movement and metadata synchronization. Question 23. Which of the following data sources can be automatically scanned by Collibra’s Data Catalog out‑of‑the‑box? A. Amazon S3 buckets B. Legacy mainframe flat files only C. Oracle, SQL Server, and Snowflake databases D. On‑premise Excel spreadsheets stored locally Answer: C Explanation: Collibra includes connectors for major relational and cloud data warehouses. Question 24. Data lineage in Collibra is visualized using: A. Sankey diagrams only B. Directed acyclic graphs showing source‑to‑target flows C. Tabular reports without graphics D. Heat maps of data volume Answer: B Explanation: Lineage graphs depict the flow of data through transformations and processes.
Explanation: Processing intensive workflows require adequate CPU and RAM to avoid bottlenecks. Question 28. A “Community” in Collibra is used to: A. Define a set of global permissions B. Group users with similar interests and provide a shared workspace for assets C. Store backup files D. Configure network routing rules Answer: B Explanation: Communities act as logical spaces where related assets and users collaborate. Question 29. Which of the following is a recommended practice for maintaining index health in Collibra? A. Disable indexing for all custom attributes B. Schedule regular re‑indexing during low‑traffic windows C. Increase the size of the index file to 10 GB arbitrarily D. Delete all old workflow logs weekly Answer: B Explanation: Regular re‑indexing removes fragmentation and improves search performance. Question 30. When configuring a Data Quality rule in Collibra, the “Severity” field determines: A. The visual color of the rule in the UI B. The priority for remediation workflows C. The number of users who can view the rule D. The storage location of the rule definition Answer: B
Explanation: Severity drives escalation paths and influences workflow routing for issues. Question 31. Which of the following best describes the purpose of a “Status” attribute on an asset? A. To record the physical location of the data file B. To indicate the governance lifecycle stage (e.g., Draft, Approved) C. To store the asset’s checksum value D. To define the asset’s data type Answer: B Explanation: Status captures where an asset is in its approval or lifecycle process. Question 32. In Collibra, “Relations” are used to: A. Store binary data files B. Define connections between two assets, such as “Is Parent Of” C. Schedule automated backups D. Configure user authentication methods Answer: B Explanation: Relations model the semantic links between assets. Question 33. Which of the following is a key benefit of implementing a Data Governance Council? A. Automating all data transformations B. Providing cross‑functional oversight and decision‑making authority C. Eliminating the need for data stewards D. Reducing data storage costs automatically Answer: B
D. Email attachment processing Answer: B Explanation: The REST API enables programmatic posting of DQ metrics into Collibra assets. Question 37. Which environment typically contains production data and must have strict change‑control procedures? A. DEV B. QA C. PROD D. UAT Answer: C Explanation: Production (PROD) holds live data and requires formal governance for any changes. Question 38. A recommended approach for backing up Collibra data is to: A. Export the entire UI as PDF weekly B. Use database snapshots combined with file system backups of configuration files C. Rely solely on cloud provider automatic backups D. Copy the application JAR files daily Answer: B Explanation: Database snapshots protect metadata, while configuration file backups preserve customizations. Question 39. During a major version upgrade, the best practice for minimizing downtime is to: A. Upgrade directly on the production node without testing B. Perform a dry‑run in a sandbox environment, then schedule a maintenance window for the cutover
C. Skip applying patches to reduce time D. Upgrade only the UI components Answer: B Explanation: Testing in a non‑production environment identifies issues before the scheduled upgrade. Question 40. Which monitoring metric is most indicative of a potential indexing problem in Collibra? A. CPU usage under 10% B. Search query latency consistently above 5 seconds C. Disk space usage below 20% D. Number of active user sessions Answer: B Explanation: High search latency often signals fragmented or overloaded indexes. Question 41. To encourage user adoption, a Collibra implementation should first provide: A. Complex custom workflows for all users B. Role‑specific training and quick‑win use cases C. Mandatory daily data entry tasks D. Unlimited custom attribute creation for every user Answer: B Explanation: Tailored training and early successes drive engagement and confidence. Question 42. Which of the following statements about Collibra’s “Views” is true? A. Views are static PDF reports only B. Views are reusable query definitions that can be embedded in dashboards C. Views replace the need for user permissions
D. Generating PDF reports Answer: B Explanation: Connectors are the integration points that move data in and out of Collibra. Question 46. Which authentication method allows Collibra to delegate user verification to an Active Directory server? A. LDAP B. SAML C. OAuth D. JWT Answer: A Explanation: LDAP queries AD for user credentials and group memberships. Question 47. A “Workflow Transition” in Collibra is used to: A. Change the asset’s visual icon B. Move a workflow instance from one state to another based on conditions or user actions C. Export workflow definitions as XML D. Reset the database schema version Answer: B Explanation: Transitions define the movement between workflow steps. Question 48. Which of the following best describes a “Data Stewardship” activity? A. Writing application code for ETL jobs B. Approving a new data set definition and ensuring its quality C. Configuring network firewalls D. Managing virtual machine snapshots
Answer: B Explanation: Data stewards review, approve, and monitor data assets for quality and compliance. Question 49. In Collibra, the “Reference Data Set” asset type is primarily used for: A. Storing raw transactional logs B. Managing master lists of codes and their hierarchies C. Capturing user login history D. Defining UI layout templates Answer: B Explanation: Reference Data Sets hold controlled vocabularies and code tables. Question 50. Which of the following is a common reason for workflow failures in Collibra? A. Excessive use of uppercase letters in asset names B. Missing required attribute values causing script errors C. Having more than one community defined D. Using an IPv6 address for the database server Answer: B Explanation: Workflows validate required fields; missing data can cause script or validation errors. Question 51. The primary purpose of a “Data Quality Dashboard” is to: A. Visualize DQ metrics, trends, and issue counts for stakeholders B. Store raw data files for analysis C. Manage user access rights D. Schedule system backups Answer: A
Explanation: Grouping keeps the model organized and improves performance. Question 55. In Collibra, a “Community” can contain its own: A. Database instance B. Set of assets, dashboards, and workflows isolated from other communities C. Operating system user accounts D. Network subnet Answer: B Explanation: Communities provide logical partitions for assets and collaboration. Question 56. Which type of Collibra deployment is recommended for a multi‑region enterprise requiring high availability? A. Single‑node on‑premise B. Clustered deployment across data centers with load balancers C. Desktop installation for each analyst D. Virtual machine with only one CPU core Answer: B Explanation: Clustering distributes load and provides failover across regions. Question 57. The “Data Catalog” connector for Snowflake primarily extracts: A. User passwords B. Table schemas, columns, and view definitions for metadata ingestion C. Network latency statistics D. Application logs Answer: B Explanation: The connector reads Snowflake’s information schema to import technical metadata.
Question 58. Which Collibra component is responsible for storing the workflow engine’s runtime data? A. Application layer B. Service layer C. Database layer D. Integration layer Answer: C Explanation: Workflow instances, tasks, and histories are persisted in the database. Question 59. When implementing GDPR “right to be forgotten,” the Collibra process should: A. Delete the entire Collibra instance B. Locate personal data assets, trigger a deletion workflow, and update retention policies accordingly C. Archive all data without deletion D. Disable user login only Answer: B Explanation: A controlled workflow ensures personal data is identified, removed, and documented. Question 60. Which of the following describes a “Data Stewardship Dashboard”? A. A UI page that shows system health metrics only B. A dashboard that lists assigned data quality issues, pending approvals, and stewardship tasks C. A report of network bandwidth usage D. A template for writing Java code Answer: B