
































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
This exam certifies professionals in data-driven consulting at the Silver level. It covers data analysis, visualization, KPI development, business intelligence, and storytelling with data. Candidates must demonstrate the ability to apply analytics to consulting engagements, supporting decision-making and strategy alignment with measurable insights.
Typology: Exams
1 / 72
This page cannot be seen from the preview
Don't miss anything!

































































Question 1. Which principle ensures that data requests are structured without overlap and cover all necessary areas? A) Pareto Principle B) MECE Principle C) KISS Principle D) SWOT Framework Answer: B Explanation: MECE (Mutually Exclusive, Collectively Exhaustive) guarantees that data requests are distinct and together address the full scope. Question 2. In a data audit, which dimension assesses whether data values are up-to-date? A) Accuracy B) Completeness C) Consistency D) Timeliness Answer: D Explanation: Timeliness measures how current the data is, critical for real-time decision making. Question 3. Which stakeholder group is most interested in high-level business impact rather than operational details? A) Front-line employees B) Middle managers C) C-suite executives D) Data engineers Answer: C Explanation: C-suite leaders focus on strategic outcomes and ROI, not day-to-day metrics. Question 4. Under GDPR, which of the following is a lawful basis for processing personal data? A) Commercial interest B) Consent
C) Data mining D) Predictive analytics Answer: B Explanation: Consent is one of the six lawful bases defined by GDPR for processing personal data. Question 5. Which step of ETL is responsible for converting dates from “MM/DD/YYYY” to ISO “YYYY-MM-DD” format? A) Extract B) Transform C) Load D) Validate Answer: B Explanation: Transformation cleanses and reshapes data, including format conversion. Question 6. When profiling a dataset, a high standard deviation relative to the mean most likely indicates: A) Many missing values B) Data duplication C) Presence of outliers D) Uniform distribution Answer: C Explanation: A large spread (high SD) suggests extreme values affecting the distribution. Question 7. Which schema design improves query performance for analytical reporting? A) 3NF normalization B) Star schema denormalization C) OLTP transactional schema D) Snowflake schema with high normalization Answer: B
C) Logistic regression D) Ridge regression Answer: C Explanation: Logistic regression models the probability of a categorical binary outcome. Question 12. In time-series analysis, the component that repeats every 12 months is called: A) Trend B) Seasonality C) Cycle D) Noise Answer: B Explanation: Seasonality captures regular patterns that recur at fixed intervals, such as yearly. Question 13. A “what-if” model that changes churn rate by ±5% while holding other variables constant is an example of: A) Monte Carlo simulation B) Scenario analysis C) Regression analysis D) Decision tree analysis Answer: B Explanation: Scenario analysis evaluates outcomes under alternative assumptions for a single variable. Question 14. When calculating ROI for a data-driven project, which denominator correctly reflects the investment? A) Net profit after implementation B) Total revenue increase C) Initial project cost D) Annual operating expense Answer: C
Explanation: ROI = (Gain – Cost) / Cost; the denominator is the initial investment. Question 15. Which metric is most useful for valuing a SaaS company based on operational data? A) EBITDA margin B) Customer Lifetime Value (CLV) C) Book value of assets D) Debt-to-Equity ratio Answer: B Explanation: CLV reflects recurring revenue potential, a key driver for SaaS valuations. Question 16. The Pyramid Principle suggests that a presentation should start with: A) Detailed data tables B) A story about the client C) The main recommendation D) The methodology used Answer: C Explanation: The principle advocates leading with the conclusion, then supporting it with evidence. Question 17. Which chart best visualizes the contribution of each product line to total revenue over time? A) Waterfall chart B) Scatter plot C) Heat map D) Pie chart Answer: A Explanation: Waterfall charts show incremental changes and cumulative totals across periods. Question 18. A heat map is most effective for displaying:
Answer: B Explanation: Dataflows allow ETL logic to be shared across multiple Power BI reports. Question 22. Which tool is known for its “Ask Data” natural-language interface? A) Tableau B) Looker C) Power BI D) Qlik Sense Answer: A Explanation: Tableau’s “Ask Data” feature enables users to pose questions in natural language and receive visual responses. Question 23. Snowflake is primarily a: A) On-premises data warehouse B) Cloud-based data platform C) Business intelligence tool D) Statistical analysis software Answer: B Explanation: Snowflake is a cloud-native data warehousing platform designed for scalable data storage and analytics. Question 24. Which low-code/no-code platform is commonly used for automating data workflows? A) Power BI B) Alteryx C) Tableau D) SAS Answer: B Explanation: Alteryx provides drag-and-drop tools for data blending, preparation, and automation without extensive coding.
Question 25. In a typical ETL process, which step involves merging data from multiple sources? A) Extract B) Transform C) Load D) Integrate Answer: B Explanation: Transformation includes data cleaning and merging, often combining data from different sources. Question 26. Which visualization best displays the distribution of a continuous variable? A) Bar chart B) Histogram C) Line chart D) Pie chart Answer: B Explanation: Histograms illustrate the frequency distribution of continuous data. Question 27. Which measure indicates the strength of a linear relationship between two variables? A) Mean B) Variance C) Correlation coefficient D) P-value Answer: C Explanation: The correlation coefficient quantifies the strength and direction of a linear relationship. Question 28. When performing regression analysis, multicollinearity can cause: A) Overfitting B) Inflated standard errors
Explanation: Sensitivity analysis tests how outcomes respond to variations in specific inputs. Question 32. Which visualization type is best suited to compare actual versus forecasted sales over time? A) Pie chart B) Line chart C) Heat map D) Scatter plot Answer: B Explanation: Line charts effectively display trends and comparisons over time. Question 33. The primary goal of data normalization is to: A) Reduce data size for storage efficiency B) Adjust values to a common scale for comparison C) Remove duplicates from datasets D) Handle missing data points Answer: B Explanation: Normalization scales data to facilitate accurate comparisons and analyses. Question 34. Which is a common approach to handle missing data when the proportion is small? A) Imputation with mean or median B) Dropping all rows with missing values C) Ignoring missing data during analysis D) Replacing missing values with zeros Answer: A Explanation: Imputation maintains data integrity when missingness is minimal and random. Question 35. Which metric is most useful for measuring the efficiency of a marketing campaign?
A) Customer Lifetime Value (CLV) B) Return on Investment (ROI) C) Churn Rate D) Net Promoter Score (NPS) Answer: B Explanation: ROI quantifies the financial return relative to campaign costs. Question 36. Which visualization technique is optimal for showing geographic distribution of sales? A) Heat map B) Bar chart C) Line chart D) Box plot Answer: A Explanation: Heat maps visually represent spatial density and distribution across regions. Question 37. Which of the following is a key benefit of dashboard drill-down capabilities? A) Prevents data corruption B) Allows users to explore data in more detail C) Eliminates the need for data cleaning D) Automates data collection Answer: B Explanation: Drill-down features enable stakeholders to explore detailed views underlying summary metrics. Question 38. Which principle advocates starting reports with conclusions supported by data? A) Pareto Principle B) Pyramid Principle C) Occam’s Razor D) Law of Large Numbers
Question 42. In the context of data privacy, “de-identification” refers to: A) Encrypting data for security B) Removing personally identifiable information C) Backing up datasets regularly D) Using anonymized aggregate data Answer: B Explanation: De-identification involves removing or masking PII to protect privacy. Question 43. Which analytical method is suitable for predicting customer churn based on historical data? A) Classification algorithms like logistic regression B) Clustering methods like K-means C) Regression analysis D) Time-series forecasting Answer: A Explanation: Classification models predict categorical outcomes such as churn (yes/no). Question 44. When assessing the financial impact of a data project, which metric captures the total cost including ongoing expenses? A) ROI B) Total Cost of Ownership (TCO) C) Payback period D) Break-even point Answer: B Explanation: TCO accounts for initial investment plus all related ongoing costs. Question 45. Which visualization technique is most appropriate to identify outliers in a dataset? A) Box plot B) Line chart C) Heat map
D) Pie chart Answer: A Explanation: Box plots highlight outliers outside the interquartile range. Question 46. Which of the following best describes the “ethics of algorithmic bias”? A) Ensuring algorithms are fast and efficient B) Preventing unintended discrimination in decision models C) Increasing automation in all processes D) Using only historical data for modeling Answer: B Explanation: Ethical considerations include avoiding algorithms that perpetuate bias or unfairness. Question 47. Which tool is primarily used for creating interactive, actionable dashboards? A) Power BI B) Excel C) Notepad++ D) RStudio Answer: A Explanation: Power BI is designed specifically for building interactive dashboards with drill-downs and filters. Question 48. Which approach helps ensure that data visualizations are accessible to non-technical audiences? A) Using complex statistical language B) Simplifying visuals and highlighting key insights C) Including raw data tables in reports D) Using only text explanations without visuals Answer: B Explanation: Clear, simple visuals with key takeaways improve understanding for all stakeholders.
C) Creating charts D) Importing data from external sources Answer: B Explanation: Pivot tables enable dynamic data summarization and analysis. Question 53. Which is the main purpose of conducting a data audit? A) To develop machine learning models B) To check data quality and readiness for analysis C) To visualize data trends D) To automate data collection Answer: B Explanation: Data audits assess accuracy, completeness, and consistency to ensure reliable analysis. Question 54. Which type of analysis helps identify the causal impact of marketing campaigns? A) Correlation analysis B) Causal inference methods like A/B testing C) Clustering analysis D) Time-series decomposition Answer: B Explanation: Causal inference, such as A/B testing, isolates the effect of specific interventions. Question 55. Which visualization technique is most effective for comparing multiple categories across a single metric? A) Bar chart B) Scatter plot C) Line chart D) Heat map Answer: A Explanation: Bar charts facilitate comparison of categories for a specific measure.
Question 56. What is the primary purpose of a “dashboard” in data analytics? A) Store raw data for backup B) Provide a real-time, visual overview of key metrics C) Automate data cleaning processes D) Replace data warehouses entirely Answer: B Explanation: Dashboards display real-time KPIs for quick decision-making. Question 57. Which statistical test is appropriate for comparing means across more than two groups? A) T-test B) ANOVA C) Chi-square test D) Correlation coefficient Answer: B Explanation: ANOVA tests for significant differences among multiple group means. Question 58. When data is denormalized for reporting purposes, it is typically stored in which schema? A) 3NF schema B) Star schema C) OLTP schema D) Entity-Relationship schema Answer: B Explanation: Denormalized star schemas optimize reporting performance. Question 59. Which of the following is most relevant to the ethical use of data? A) Maximizing data collection regardless of privacy concerns B) Ensuring compliance with privacy laws and user consent C) Avoiding data audits to expedite analysis
Question 63. Which feature of Power BI allows for data to be refreshed automatically at scheduled times? A) Dataflows B) Data refresh settings C) Data modeling D) Power Query Editor Answer: B Explanation: Power BI’s scheduled refresh enables automatic updates of datasets. Question 64. The “Total Cost of Ownership” (TCO) includes which of the following? A) Only initial software licensing fees B) Implementation costs, maintenance, and operational expenses C) Only hardware costs D) Revenue generated by the project Answer: B Explanation: TCO encompasses all costs associated with acquiring, implementing, and maintaining a system. Question 65. Which term describes the process of converting raw data into a format suitable for analysis? A) Data cleaning B) Data modeling C) Data wrangling D) Data visualization Answer: C Explanation: Data wrangling involves transforming raw data into a cleaned, analysis-ready format. Question 66. Which visualization best illustrates the contribution of each part to a total? A) Pie chart
B) Line chart C) Heat map D) Scatter plot Answer: A Explanation: Pie charts display parts of a whole in proportions. Question 67. Which is a common method for identifying seasonality in time- series data? A) Autocorrelation function (ACF) B) Linear regression C) Clustering D) Principal Component Analysis (PCA) Answer: A Explanation: ACF detects repeating patterns indicative of seasonality. Question 68. In a regression model, multicollinearity can cause: A) Increased model interpretability B) Unstable coefficient estimates C) Improved predictive accuracy D) Reduced variance inflation factors (VIF) Answer: B Explanation: Multicollinearity destabilizes coefficient estimates, making them unreliable. Question 69. Which of the following best describes “descriptive analytics”? A) Explaining why something happened B) Predicting future outcomes C) Summarizing past data to understand what happened D) Prescribing actions to improve performance Answer: C Explanation: Descriptive analytics summarizes historical data to understand past events.