












Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
ALBERTA DATA ANALYST EXAM QUESTIONS AND CORRECT (VERIFIED ANSWER) PLUS RATIONALE 2026 Q&A|INSTANT DOWNLOAD PDF
Typology: Exams
1 / 20
This page cannot be seen from the preview
Don't miss anything!













1 โ 10: Data Fundamentals
1. Data analysis is used to: A. Build hardware B. Interpret and draw insights from data C. Design websites D. Write operating systems Answer: B Rationale: Data analysis transforms raw data into meaningful insights. 2. Qualitative data is: A. Numeric data B. Non-numeric data C. Encrypted data D. Random data Answer: B Rationale: Descriptive data like text or categories. 3. Quantitative data is: A. Text data B. Numeric data C. Image data D. Audio data Answer: B Rationale: Measurable numerical values. 4. Data cleaning refers to: A. Deleting database B. Fixing errors and inconsistencies
C. Encrypting data D. Compressing files Answer: B Rationale: Improves data quality.
5. Dataset is: A. A single number B. Collection of data C. A program D. A server Answer: B Rationale: Organized collection of data. 6. Data visualization means: A. Coding B. Graphical representation of data C. Encryption D. Storage Answer: B Rationale: Charts and graphs represent data. 7. Big Data refers to: A. Small datasets B. Extremely large datasets C. Single file data D. Random data Answer: B Rationale: Large complex datasets. 8. Data integrity ensures: A. Faster internet B. Accuracy and consistency C. File compression D. UI design
Answer: B Rationale: All formulas begin with equals sign.
13. SUM function is used to: A. Multiply numbers B. Add values C. Divide numbers D. Sort data Answer: B Rationale: Adds a range of values. 14. VLOOKUP is used to: A. Visualize charts B. Search data vertically C. Delete rows D. Encrypt data Answer: B Rationale: Finds values in a column. 15. Pivot table is used for: A. Gaming B. Summarizing data C. Coding D. Storage Answer: B Rationale: Aggregates large datasets. 16. Chart in Excel is used for: A. Programming B. Visualization C. Encryption D. Debugging Answer: B Rationale: Represents data graphically.
17. Cell reference A1 means: A. Column A row 1 B. Row A column 1 C. File name D. Formula Answer: A Rationale: Excel coordinate system. 18. Filtering data means: A. Deleting file B. Showing selected data C. Encrypting data D. Compressing file Answer: B Rationale: Displays specific rows. 19. Sorting arranges data: A. Randomly B. Ascending/descending order C. Encrypted D. Deleted Answer: B Rationale: Organizes data order. 20. Absolute reference uses: A. A B. $A$ C. A1:A D. =A Answer: B Rationale: Locks row and column. 21 โ 30: SQL Basics
B. Records C. Tables D. Server Answer: B Rationale: Deletes data rows.
26. WHERE clause is used for: A. Sorting B. Filtering C. Grouping D. Joining Answer: B Rationale: Filters results. 27. PRIMARY KEY is: A. Duplicate value B. Unique identifier C. Index D. Foreign key Answer: B Rationale: Uniquely identifies records. 28. FOREIGN KEY connects: A. Files B. Tables C. Servers D. Networks Answer: B Rationale: Links tables together. 29. JOIN is used to: A. Encrypt data B. Combine tables C. Delete data
D. Sort files Answer: B Rationale: Merges related tables.
30. COUNT function returns: A. Text B. Number of rows C. Average D. Sum Answer: B Rationale: Counts records. **31 โ 40: Statistics
38. Outlier is: A. Normal value B. Extreme value C. Average D. Median Answer: B Rationale: Unusual data point. 39. Probability is: A. Certainty B. Likelihood of event C. Error D. Data type Answer: B Rationale: Chance of occurrence. 40. Sample is: A. Entire population B. Subset of population C. Error data D. Database Answer: B Rationale: Part of dataset. **41 โ 50: Python for Data Analysis
42. Pandas is used for: A. Web design B. Data manipulation C. Networking D. Encryption Answer: B Rationale: Handles structured data efficiently. 43. NumPy is used for: A. Images only B. Numerical computation C. UI design D. APIs Answer: B Rationale: Efficient array processing. 44. Matplotlib is used for: A. Databases B. Data visualization C. Servers D. Security Answer: B Rationale: Creates graphs and charts. 45. Seaborn is built on: A. SQL B. Matplotlib C. Java D. HTML Answer: B Rationale: Enhances statistical plotting. 46. Jupyter Notebook is used for: A. Networking
Answer: B Rationale: Maps keys to values. 51 โ 60: Data Cleaning & Preparation
51. Data cleaning means: A. Encrypting data B. Fixing errors and inconsistencies C. Deleting database D. Compressing files Answer: B Rationale: Improves data quality. 52. Missing data should be: A. Ignored always B. Handled properly C. Duplicated D. Encrypted Answer: B Rationale: Can distort analysis. 53. Outlier is: A. Normal value B. Extreme value C. Average value D. Median Answer: B Rationale: Data point far from others. 54. Duplicate data causes: A. Better accuracy B. Bias in analysis C. Faster processing D. Security
Answer: B Rationale: Skews results.
55. Data transformation means: A. Deleting data B. Converting data format C. Encrypting data D. Compressing images Answer: B Rationale: Changes structure or format. 56. Normalization in data means: A. Increasing size B. Scaling values C. Removing database D. Encrypting files Answer: B Rationale: Standardizes data. 57. Standardization converts data to: A. Random format B. Mean = 0, SD = 1 C. Text only D. Binary only Answer: B Rationale: Normal statistical scaling. 58. Feature engineering is: A. Deleting columns B. Creating useful variables C. Encrypting data D. Sorting only Answer: B Rationale: Improves model performance.
63. Mode is: A. Average B. Most frequent value C. Random value D. Outlier Answer: B Rationale: Most common value. 64. Standard deviation measures: A. Count B. Data spread C. Average D. Mode Answer: B Rationale: Variability in dataset. 65. Correlation measures: A. Causation B. Relationship between variables C. Errors D. Size Answer: B Rationale: Strength of relationship. 66. Positive correlation means: A. One increases, other decreases B. Both increase together C. No relation D. Random Answer: B Rationale: Direct relationship. 67. Probability is: A. Certainty
B. Likelihood of event C. Error D. Dataset Answer: B Rationale: Chance of occurrence.
68. Regression is used for: A. Classification B. Predicting continuous values C. Sorting D. Cleaning data Answer: B Rationale: Numeric prediction. 69. Classification predicts: A. Numbers B. Categories C. Storage D. Errors Answer: B Rationale: Assigns labels. 70. Time series data is: A. Random data B. Data over time C. Static data D. Image data Answer: B Rationale: Chronological dataset. **71 โ 80: Business Intelligence & Tools
Answer: B Rationale: Uses data for decisions.
76. Insight means: A. Raw data B. Meaningful conclusion C. Error D. File Answer: B Rationale: Interpreted result. 77. Data storytelling is: A. Writing code B. Communicating insights clearly C. Encrypting data D. Sorting files Answer: B Rationale: Explains data meaning. 78. Dashboard KPI tracking helps: A. Increase errors B. Monitor performance C. Delete data D. Encrypt files Answer: B Rationale: Tracks business health. 79. Data analyst role includes: A. Hardware repair B. Interpreting data C. Network setup D. UI design only Answer: B Rationale: Extracts insights from data.
80. Final goal of data analysis is: A. Complex charts B. Actionable insights for decision-making C. Random data D. Storage expansion Answer: B Rationale: Supports business decisions.