










Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An overview of the caboodle console, a web application that serves as the interface for a data warehouse system. It covers key concepts such as the caboodle architecture, which includes a staging database and a reporting database, the use of ssis packages for data extraction and transformation, the data model structure based on a star schema, and the enforcement of strict naming conventions. The document also discusses data lineage, referential integrity, and the handling of type 1 and type 2 data changes. Additionally, it covers topics related to fact tables, dimension tables, bridge tables, and attribute-value dimension tables. The information presented in this document would be useful for understanding the design and functionality of the caboodle data warehouse system, which is commonly used in healthcare and other industries.
Typology: Exams
1 / 18
This page cannot be seen from the preview
Don't miss anything!











Quiz :CHAPTER 1-Caboodle Console - Answer :The Caboodle Console is a web application housed on the Caboodle server. It includes the following: Dictionary Dictionary Editor Executions Work Queue Configuration Quiz :CHAPTER 1-Data Warehouse - Answer :In a data warehouse, multiple sources may load data pertaining to a single entity. This means that more than one package may populate a given row in a Caboodle table. As a result, there may be multiple business key values associated with a single entity in a Caboodle table. Quiz :CHAPTER 1-ETL - Answer :Extract, Transform, Load Quiz :CHAPTER 1-SSIS Package - Answer :The architecture of Caboodle includes a staging database and a reporting database. Data is extracted from source systems (like Clarity), transformed in the staging database, and presented for users in the reporting database. This movement of data is realized via a set of SQL Server Integration Services (SSIS) packages. Quiz :CHAPTER 1-Data Lineage - Answer :Generally, data lineage refers to the process of identifying the source of a specific piece of information. In Caboodle, data lineage is defined at the package level. Quiz :CHAPTER 1-Star Schema - Answer :The standard schema for a dimensional data model. The name refers to the image of a fact table surrounded by many linked dimension tables, which loosely resembles a star. The Caboodle data model structure is based on a "star schema" ‐ where one central fact table will join to many associated lookup or dimension tables. This structure provides the foundation of the Caboodle data model. Quiz :CHAPTER 1-DMC - Answer :DATA MODEL COMPONENT
No table in Caboodle "stands alone." Each is considered part of a Data Model Component, which refers to the collection of metadata tables that support the ETL process and reporting views stored in the FullAccess schema. Each DMC gets a type. Strict table naming conventions are followed in Caboodle, so that a table's suffix provides information about its structure and purpose. These suffixes are: · Dim for dimensions (e.g. PatientDim) · Fact for facts (e.g. EncounterFact) · Bridge for bridges (e.g. DiagnosisBridge) · DataMart for data marts (e.g. HospitalReadmissionDataMart) · AttributeValueDim for EAV tables (e.g. PatientAttributeValueDim) · X for custom tables (e.g. CustomFactX) Quiz :CHAPTER 1-Staging Database - Answer :The Caboodle database into which records are loaded by SSIS packages and stored procedures. Quiz :CHAPTER 1-Reporting Database - Answer :The architecture of Caboodle includes a staging database and a reporting database. Data is extracted from source systems (like Clarity), transformed in the staging database, and presented for users in the reporting database. This movement of data is realized via a set of SQL Server Integration Services (SSIS) packages. Quiz :CHAPTER 1-Dbo Schema - Answer :STAGING DATABASE Import tables and Mapping tables live here. This is primarily used by administrators for moving data into Caboodle. REPORTING DATABASE The dbo schema stores reporting data and acts as the data source for SlicerDicer. The Caboodle Dictionary reflects the contents of the dbo schema. Quiz :CHAPTER 1-FullAccess Schema - Answer :STAGING DATABASE The FullAccess schema does not exist on the Staging database. REPORTING DATABASE The FullAccess schema houses views that simplify reporting. FullAccess should be your default schema when reporting.
A. Users B. ED Visits C. Diagnosis definitions - Answer :A. Users B. ED Visits Users and diagnosis definitions would be in their own dimension tables. ED Visits are considered reportable, measureable events and would be in a fact table. Quiz :CHAPTER 2-Which of the following are possible in Caboodle? For each of the answers you select, use the Caboodle Dictionary to find an example. A. Joining a Fact table to another Fact table B. Joining a Fact table to a Dimension table C. Joining a Dimension table to a Dimension table D. Joining two different Fact tables to the same Profile Dimension table - Answer :(Examples may vary) a. EncounterFact can join to HospitalAdmissionFact b. MedicationOrderFact can join to PatientDim c. PatientDim can join to ProviderDim Quiz :CHAPTER 2-Which of the following can be populated by more than one package? Select ALL that apply. A. A single table B. A single row in a table - Answer :A & B Quiz :CHAPTER 2-TRUE or FALSE: Caboodle and Clarity can be used interchangeably. - Answer :FALSE Quiz :CHAPTER 2-Fact Table - Answer :The core table types in a dimensional data model are facts and dimensions. A row in a fact table corresponds to the occurrence of some significant, measurable event, such as the ordering of a medication or the posting of a charge transaction. A fact table typically contains lookup columns to several dimension tables. This arrangement somewhat resembles a star and so is referred to as a "star schema Fact tables typically join to dimension tables, but may also join to other types of DMCs, such as other fact tables and bridge tables.
All fact tables have a Count column, which stores 0 for the negative rows, 0 for any row that represents deleted data, and 1 otherwise. The names of fact tables end with "Fact." All Fact Tables in Caboodle use a surrogate key as their primary key. Quiz :CHAPTER 2-Dimension Table - Answer :The core table types in a dimensional data model are facts and dimensions. A row in a dimension table represents a concrete entity which provides context for an event, such as the patient for whom a medication was ordered. A fact table typically contains lookup columns to several dimension tables. This arrangement somewhat resembles a star and so is referred to as a "star schema." Fact tables typically join to dimension tables, but may also join to other types of DMCs, such as other fact tables and bridge tables. It is possible, but not common, for a dimension table to contain lookup columns to other DMCs. All Dimension Tables in Caboodle use a surrogate key as their primary key. Quiz :CHAPTER 2-Surrogate Key - Answer :A system‐generated identifier in Caboodle. Not inherently meaningful to report consumers. All fact and dimension tables in Caboodle use a surrogate key as their primary key. Surrogate key column names match the name of the table. Surrogate key values are created during the ETL process. Each surrogate key value uniquely identifies one row of the table. Surrogate key values are not the same as source identifiers. Quiz :CHAPTER 2-Source Identifier - Answer :Unique identifier in the source data. Consists of two columns in Caboodle: IdType and Id. Also known as a business key. During the ETL process, a load package must provide an IdType and Id, collectively known as the business key. The IdType is a label for the type of
Type 1 = Current Type 2 = Storing data over time Quiz :CHAPTER 3-Consider the following query: SELECT DurableKey FROM
Quiz :CHAPTER 3-Durable Key Lookup - Answer :Ends in DurableKey: joins to DurableKey column Quiz :CHAPTER 3-Type 2 Entity - Answer :Many tables in Caboodle reference patients, providers, and employees. PatientDim, ProviderDim, and EmployeeDim are Type 2 tables in Caboodle. Patients, providers, and employees are examples of Type 2 entities in Caboodle. When a fact table references a patient, it is referencing a Type 2 entity, and it has two ways to do so: by the primary key or by the durable key of PatientDim. Quiz :CHAPTER 1-What are the differences between a normalized and dimensional data model? - Answer :In a normalized data model, the focus is on not repeating data, which reduces the size of the database. In a dimensional data model, the focus is on ease of reporting and uses the star schema, which focuses on a central fact table pertaining to a reportable event and surrounding dimension tables providing context for the event. CONTEXT...It gives CONTEXT Quiz :CHAPTER 1-Briefly define the roles of the Caboodle report writer, administrator, and developer. - Answer :· The Caboodle report writer queries data that already exists in the database. They use their knowledge of the tools and the source database(s) to conduct research into the necessary data points required for a given report. · The Caboodle administrator uses the Caboodle Console to manage and monitor the ETL process. They troubleshoot ETL errors and handle configuration steps for the database. · The Caboodle developer designs custom DMCs (Data Model Components) and writes SSIS packages to bring additional data into the warehouse. This may be additional Epic data from Clarity or non‐Epic data from 3rd party sources. Quiz :CHAPTER 1-TRUE or FALSE: Naming conventions are enforced in Caboodle. - Answer :True. Naming conventions, such as
Quiz :CHAPTER 4-TRUE or FALSE: All inferred rows in Caboodle have a primary key of ‐1. - Answer :False Inferred rows in Caboodle have non‐negative primary key values because they represent an entity that Caboodle has inferred the existence of. The foreign key columns for such rows will be set to ‐1 until the Caboodle ETL process updates the data. Quiz :CHAPTER 4-Referential Integrity - Answer :Referential integrity is enforced in Caboodle, meaning that lookup columns will never contain null values. Furthermore, every value in a lookup column has a match in the destination table. Caboodle enforces referential integrity by:
Since many date columns in Caboodle are foreign keys to the DateDim table, NULL dates from Clarity are often represented with a ‐1 value DateKey columns in Caboodle. Quiz :CHAPTER 4-The - 2 Row - Answer :AS PART OF REFERENTIAL INTEGRITY When a foreign key is not applicable (N/A) as defined by the SSIS package, Caboodle sets the value to - 2. This is handled by the ETL process. Quiz :CHAPTER 4-The - 3 Row - Answer :AS PART OF REFERENTIAL INTEGRITY If an account was deleted in the source, all foreign key values are set to ‐3. When a record is hard‐deleted in the source system or no longer passes the filters required for extraction, Caboodle tracks the deletion in one of two ways. · For Type 1 DMCs, the existing row is updated with default values representing the deletion. The primary key remains unchanged. · For Type 2 DMCs, a new row is added that represents the deletion. Existing Type 2 data remains, while Type 1 data is overwritten with the default values. The primary key﴾s﴿ and durable key will remain unchanged. Quiz :CHAPTER 5-What two columns do all bridge tables have? - Answer :Every bridge table has a ComboKey to which other tables join and a Foreign Key used to join to its associated dimension table. Quiz :CHAPTER 5-TRUE of FALSE: AttributeValueDim tables are more granular than their associated fact table. - Answer :True The AttributeValueDim tables can store multiple attribute‐value combinations on separate rows for the same entity, making them more granular than their associated fact table Quiz :CHAPTER 5-Bridge Table - Answer :Bridge tables exist to capture many‐ to‐many relationships, such as the one between patients and the diagnoses on their problem list.
D. PharmacyKey stores the PHR record ID; PharmacyEpicId uniquely identifies a row in PharmacyDim - Answer :C. PharmacyKey uniquely identifies a row in PharmacyDim; PharmacyEpicId stores the PHR record ID. A user doesn't have access to an RW Report unless both the user and the report share a report group in common. Quiz :PRACTICE EXAM-How can you tell if a column is a foreign key in Caboodle? CHOOSE ALL THAT APPLY A. The column is marked as Type 2 B. The ER diagram lists it as a foreign key C. The column is contained in a fact table D. The destination table is listed to the right of the column description - Answer :B. The ER diagram lists it as a foreign key D. The destination table is listed to the right of the column description Type 1 tables do not track changes, so do not need a DurableKey or IsCurrent column. Quiz :PRACTICE EXAM-Tables in Caboodle are categorized as either Type 1 or Type 2. This best describes which of the following? CHOOSE ONLY ONE ANSWER A. Caboodle enforces referential integrity B. Caboodle can track changes in the source C. Caboodle uses the dimensional data model D. Caboodle uses the entity-attribute-value data model - Answer :B. Caboodle can track changes in the source. All Type 2 tables have special columns to help track these changes; DurableKey, StartDate, EndDate, and IsCurrent. Quiz :PRACTICE EXAM-Consider the following query: Which of the following logical expressions will return all hospital admissions where the admitting provider was the same as the discharging provider? SELECT EncounterEpicCsn, AdmittingProviderKey, DischargingProviderKey FROM HospitalAdmissionFact WHERE
Dimension SetDim DataMart AttributeValueDim Bridge - Answer :Bridge Although it is the destination of the join, the combo key column of a bridge table is not the primary key. Rather, multiple rows may have the same combo key value. Quiz :OVERALL-Granularity - Answer :The level of detail for one row in the table. Quiz :OVERALL-Primary Key - Answer :The Primary Key is the column(s) that can uniquely ID a row in a table. Quiz :OVERALL-Surrogate Key - Answer :Surrogate Key is a key where the values don't exist in the source data. These values are NOT in Clarity. They are generated in the ETL process...They are created when moved. Quiz :OVERALL-NAME COVENTIONS IN CABOODLE - Answer :Each DMC gets a type. Strict table naming conventions are followed in Caboodle, so that a table's suffix provides information about its structure and purpose. These suffixes are: · Dim for dimensions (e.g. PatientDim) · Fact for facts (e.g. EncounterFact) · Bridge for bridges (e.g. DiagnosisBridge) · DataMart for data marts (e.g.HospitalReadmissionDataMart) · AttributeValueDim for EAV tables (e.g. PatientAttributeValueDim) · X for custom tables .e.g. CustomFactX Quiz :OVERALL-ER Diagram - Answer :An entity‐relationship diagram, or ER diagram, is a visual tool that displays information about how database objects relevant to a particular reporting area join to one another. Caboodle ER diagrams are automatically generated for fact tables and data mart tables in the dictionary. Quiz :DEFINITION-SURROGATE KEY - Answer :A surrogate key is any column or set of columns that can be declared as the primary key instead of a "real" or natural key. Sometimes there can be several natural keys that could be
declared as the primary key, and these are all called candidate keys. So a surrogate is a candidate key. Quiz :OVERALL-CSN - Answer :Contact Serial Number Quiz :OVERALL-Type 1 - Answer :Type 1 tables and columns in Caboodle do not track historical data. When a Type 1 value is updated in the source, all instances of that value in Caboodle are updated the next time the data gets extracted. If a table is Type 1---THEN all of the columns in that table are Type 1. Quiz :OVERALL-Type 2 - Answer :Caboodle retains historical values by creating multiple rows for each entity, with each row containing the values for a particular time period. Typically, Type 2 tables contain a mix of Type 1 and Type 2 columns. A Type 2 table must contain at least one Type 2 column. KNOW THIS!! Type 2 tables typically have a mix of Type 1 and Type 2 columns. Only changes to Type 2 data in the source will result in a new row being added to a Type 2 table for a given entity. When Type 1 data changes, Caboodle stores only the current value in all existing rows for the entity. Quiz :OVERALL-IsCurrent - Answer :The IsCurrent column is a flag that stores the value of 1 if the row holds the most current information and 0 otherwise. Quiz :OVERALL-Enforcing Referential Integrity - Answer :Caboodle uses three mechanisms for enforcing referential integrity: · Every table in Caboodle contains three rows with surrogate keys of ‐1, ‐2, and ‐3. These rows serve as the destination for lookup columns in other tables. · When source data is unavailable to populate a lookup column, Caboodle supplies a default value of ‐1, ‐2, or ‐3. Which value is used depends on the circumstance behind the missing information. · When data loaded in one Caboodle table is more up‐to‐date than data in another table, the Caboodle ETL process generates an inferred row in the delayed table. This row represents the entity being referenced by the more up‐ to‐date table. Inferred rows act as place holders until the relevant information is loaded into Caboodle.