
















































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
A broad-coverage exam that tests foundational and intermediate knowledge of the entire Vantage ecosystem — architecture, administration, analytics, SQL functions, data engineering workflows, and cloud capabilities. Ideal for learners seeking platform proficiency across multiple domains.
Typology: Exams
1 / 88
This page cannot be seen from the preview
Don't miss anything!

















































































Question 1. Which statement best describes a relational database? A) Stores data only in hierarchical trees B) Uses tables with rows and columns and enforces relationships via keys C) Stores data as flat files without any schema D) Requires all data to be in JSON format Answer: B Explanation: A relational database organizes data into tables (relations) with rows (records) and columns (attributes) and defines relationships using primary and foreign keys. Question 2. In Teradata, what is the primary difference between a Primary Index (PI) and a Primary Key (PK)? A) PI determines data distribution; PK enforces uniqueness only B) PK determines data distribution; PI enforces uniqueness only C) Both are identical concepts in Teradata D) PI is used only for temporary tables, PK for permanent tables Answer: A Explanation: The Primary Index controls how rows are hashed and distributed across AMPs, while the Primary Key is a logical constraint ensuring uniqueness but does not affect distribution. Question 3. Which schema is characterized by a single fact table surrounded by denormalized dimension tables? A) Snowflake schema B) Star schema C) Third Normal Form (3NF) schema D) Galaxy schema Answer: B
Explanation: A star schema has a central fact table with flat, denormalized dimension tables, simplifying query performance. Question 4. When would you prefer a Snowflake schema over a Star schema? A) When query performance is the highest priority B) When you need to minimize storage by normalizing dimensions C) When the data model has no dimensions D) When you want to avoid any joins in queries Answer: B Explanation: Snowflake schemas normalize dimension tables, reducing redundancy and saving storage at the cost of additional joins. Question 5. Which of the following is a key benefit of the Teradata Vantage Cloud architecture? A) Fixed on‑premise hardware only B) Automatic scaling of compute and storage resources on demand C) Requires manual provisioning of each node D) No support for hybrid cloud deployments Answer: B Explanation: Vantage Cloud provides elastic scaling, allowing compute and storage to be adjusted automatically based on workload. Question 6. In Vantage, which object type is used to encapsulate reusable SQL logic that can be invoked like a table? A) Macro B) View C) Stored Procedure
C) It handles user authentication only D) It is the physical network fabric between nodes Answer: B Explanation: The Data Access Layer parses SQL, creates execution plans, and manages query execution. Question 10. Which type of index is most appropriate for accelerating join performance on non‑primary key columns? A) Primary Index B) Secondary Index (SI) C) Join Index (JI) D) Unique Primary Index (UPI) Answer: C Explanation: Join Indexes pre‑join tables on specified columns, improving join query performance. Question 11. What is data skew in the context of Teradata Primary Index distribution? A) Uniform distribution of rows across all AMPs B) Uneven distribution causing some AMPs to hold significantly more rows C) A security vulnerability in data encryption D) A feature that automatically balances load Answer: B Explanation: Data skew occurs when the hash of the PI causes an uneven row distribution, leading to performance bottlenecks. Question 12. Which partitioning method allows Teradata to eliminate partitions dynamically based on query predicates?
A) Range Partitioning B) Hash Partitioning C) Dynamic Partition Elimination (DPE) D) List Partitioning Answer: C Explanation: DPE enables Teradata to skip reading partitions that are not needed for the query, improving performance. Question 13. Which locking level in Teradata is the most granular? A) Database level B) Table level C) Row level D) Transaction level Answer: C Explanation: Row‑level locks lock individual rows, providing the finest granularity. Question 14. Which data type would you use to store semi‑structured JSON documents in Vantage? A) VARCHAR B) BLOB C) JSON D) CLOB Answer: C Explanation: Teradata provides a native JSON data type for storing and querying JSON documents. Question 15. Which of the following is a valid use of the PERIOD data type?
B) SUM() OVER (ORDER BY date) C) COUNT(*) D) MAX() OVER (PARTITION BY col) Answer: B Explanation: SUM() used as a window function with ORDER BY produces a cumulative total. Question 19. What is the purpose of the EXTENDED GROUP BY operator (e.g., GROUP BY ROLLUP)? A) To enforce row‑level security B) To generate hierarchical subtotals and grand totals in a single query C) To compress data during aggregation D) To create indexes automatically Answer: B Explanation: ROLLUP and CUBE produce grouped subtotals and grand totals for multi‑dimensional analysis. Question 20. During database design, which activity helps identify the appropriate data domains for columns? A) Data profiling B) Index rebuilding C) Query rewriting D) Log shipping Answer: A Explanation: Data profiling examines source data to determine value ranges, patterns, and appropriate data domains.
Question 21. Which statistic collection method provides the optimizer with the most accurate row‑count estimates? A) Default statistics (no collection) B) COLLECT STATISTICS on all columns used in predicates and joins C) Manual estimation by the DBA D) Using only primary key statistics Answer: B Explanation: Collecting statistics on columns used in predicates and joins gives the optimizer accurate cardinality estimates. Question 22. In an EXPLAIN plan, what does a “Hash Join” indicate? A) The join is performed using nested loops B) The optimizer will build a hash table on the smaller input to probe with the larger input C) The join is executed on the client side D) The join will be performed after all aggregations are complete Answer: B Explanation: A Hash Join builds a hash table from one input (typically the smaller) and probes it with the other input. Question 23. Which type of Referential Integrity (RI) constraint allows NULL values in the foreign key column? A) NOT NULL RI B) Cascading Delete RI C) Optional RI (foreign key can be NULL) D) Enforced RI without NULLs Answer: C
D) They cannot be parallelized across AMPs Answer: C Explanation: Access Modules extract data row‑wise, which may be less efficient for columnar workloads. Question 27. Which step is essential before moving data from on‑premise Teradata to Amazon S3? A) Enabling TASM throttling B) Configuring an external stage with appropriate IAM credentials C) Creating a Viewpoint dashboard D) Defining a Join Index on S3 objects Answer: B Explanation: An external stage defines the S3 bucket and IAM credentials needed for secure data transfer. Question 28. QueryGrid in Vantage primarily enables which capability? A) In‑memory caching of query results B) Federated queries across heterogeneous data sources (e.g., Hadoop, Oracle) C) Automatic index creation D) Real‑time data streaming into Vantage Answer: B Explanation: QueryGrid allows Vantage to execute queries that join data residing in external systems. Question 29. Which authentication method is considered the most secure for Vantage users? A) Password only
B) Kerberos with ticket granting C) LDAP simple bind D) Shared secret in clear text Answer: B Explanation: Kerberos provides ticket‑based authentication, reducing password exposure and supporting single sign‑on. Question 30. What is the purpose of a “proxy user” in Teradata security? A) To store encrypted passwords B) To act on behalf of another user, inheriting their privileges C) To manage database backups D) To create temporary tables automatically Answer: B Explanation: Proxy users allow one account to execute statements using another user’s privileges, facilitating service accounts. Question 31. Which Viewpoint portlet helps administrators monitor spool space consumption? A) System Health B) Space Utilization C) Query Performance D) User Activity Answer: B Explanation: The Space Utilization portlet displays perm, temp, and spool space usage per user/database. Question 32. Which metric indicates that a query is experiencing blocking in Vantage?
Question 35. In the context of analytics, which function is used to calculate the sentiment score of a text column? A) TEXT_ANALYZE() B) SENTIMENT() C) REGEXP_COUNT() D) CHAR_LENGTH() Answer: B Explanation: The SENTIMENT() function (available via ClearScape) evaluates textual data to produce a sentiment score. Question 36. Which language is NOT natively supported for in‑database analytics in ClearScape? A) Python B) R C) JavaScript D) SAS Answer: C Explanation: ClearScape provides native support for Python, R, and SAS, but not JavaScript. Question 37. ModelOps in Vantage primarily addresses which phase of the machine‑learning lifecycle? A) Data ingestion only B) Model training, deployment, and monitoring C) Data visualization only D) Index creation for model tables Answer: B
Explanation: ModelOps provides tools for deploying, managing, and monitoring models in production. Question 38. When operationalizing a predictive model pipeline, which step should occur first? A) Model scoring in production B) Model validation against hold‑out data C) Model deployment to Vantage D) Model retirement Answer: B Explanation: Validation ensures the model performs adequately before it is deployed. Question 39. Which visualization type is best suited for showing the distribution of a single numeric variable? A) Bar chart B) Line chart C) Histogram D) Pie chart Answer: C Explanation: Histograms display frequency distribution across numeric intervals. Question 40. In statistical analysis, what does a p‑value less than 0.05 typically indicate? A) The null hypothesis is definitely true B) There is strong evidence against the null hypothesis C) The data is normally distributed D) The sample size is too small Answer: B
Answer: B Explanation: EXPORT writes data from Teradata to external files. Question 44. When integrating with Azure Blob storage, which authentication method is commonly used? A) SSH keys B) Shared Access Signature (SAS) token C) Kerberos tickets D) LDAP bind DN Answer: B Explanation: SAS tokens grant time‑limited access to Azure Blob containers for secure data movement. Question 45. Which of the following best describes “Data Mesh” architecture? A) Centralized data lake with single governance B) Distributed domain‑oriented ownership of data with self‑serve infrastructure C) Traditional ETL pipeline to a relational warehouse D) In‑memory analytics on a single node Answer: B Explanation: Data Mesh promotes decentralized data ownership, treating data as a product with domain teams responsible for it. Question 46. In Teradata, what does the “ROW HASH” function compute? A) The checksum of a row for data integrity verification B) The hash value used for Primary Index distribution C) The encryption key for column-level security
D) The row number in a result set Answer: B Explanation: ROW HASH calculates the hash of the Primary Index columns to determine the target AMP. Question 47. Which lock mode allows a transaction to read a row while another transaction holds a write lock on the same row? A) Exclusive lock (X) B) Shared lock (S) C) Read‑Committed lock (RC) D) No lock (NL) Answer: B Explanation: Shared locks permit concurrent reads; exclusive locks block other accesses. Question 48. What is the purpose of the “ROW LEVEL SECURITY” (RLS) feature? A) To encrypt rows at rest B) To restrict row visibility based on user predicates C) To compress rows automatically D) To distribute rows across AMPs evenly Answer: B Explanation: RLS applies row‑filtering predicates per user, ensuring users see only authorized rows. Question 49. Which SQL clause is used to remove duplicate rows from a result set? A) DISTINCT B) GROUP BY C) HAVING
Answer: B Explanation: RPI hashes the entire row to determine distribution, useful for tables without a natural PI. Question 53. In Vantage, what is the default isolation level for a session? A) READ UNCOMMITTED B) READ COMMITTED C) REPEATABLE READ D) SERIALIZABLE Answer: B Explanation: The default isolation level is READ COMMITTED, allowing non‑repeatable reads but preventing dirty reads. Question 54. Which Teradata utility is optimized for high‑speed bulk loading of data into empty tables? A) FastExport B) FastLoad C) MultiLoad D) TPT Stream Answer: B Explanation: FastLoad is designed for fast loading into empty target tables. Question 55. What is the main advantage of using a “Secondary Index” (SI) on a frequently filtered column? A) It reduces data skew on the Primary Index B) It eliminates the need for a Primary Index C) It allows the optimizer to locate rows without scanning the entire table
D) It automatically partitions the table by the indexed column Answer: C Explanation: An SI provides an alternate path to locate rows based on the indexed column, improving query performance. Question 56. Which of the following best describes “Query Rewrite” in Vantage? A) Automatic transformation of a query into a more efficient form using materialized views or aggregates B) Manual editing of SQL text by the user C) Changing the query language from SQL to Python D) Encrypting the query before execution Answer: A Explanation: Query Rewrite uses metadata (e.g., materialized views) to replace parts of a query with pre‑computed results. Question 57. In a multi‑tenant Vantage environment, which feature helps isolate resource usage per tenant? A) TASM resource pools and throttles B) Primary Index selection C) Row‑level security only D) Default user passwords Answer: A Explanation: TASM allows defining resource pools, throttles, and classification to isolate tenant workloads. Question 58. Which command would you use to grant SELECT privilege on a table to a role? A) GRANT SELECT ON TABLE mydb.mytable TO ROLE analytics_role;