































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
D467 - EXPLORING DATA MIDTERM ASSESSMENT EXAM LATEST / D467 - EXPLORING DATA ASSESSMENT EXAM REAL EXAM 70 QUESTIONS AND ANSWERS|PASSED WITH A+ What are cookies? Types of malware that can damage computers Small files stored on computers that contain information about users Programs that enable users to access websites Pieces of code that store information about a website - CORRECT ANSWER >>>Small files stored on computers that contain information about users Fill in the blank: For data analytics projects, _____ data is typically preferred because users know it originated within the organization.
Typology: Exams
1 / 39
This page cannot be seen from the preview
Don't miss anything!
































What are cookies? Types of malware that can damage computers Small files stored on computers that contain information about users Programs that enable users to access websites Pieces of code that store information about a website - CORRECT ANSWER >>> Small files stored on computers that contain information about users Fill in the blank: For data analytics projects, _____ data is typically preferred because users know it originated within the organization. second-party third-party multi-party first-party
- CORRECT ANSWER >>> first-party A grocery store chain purchases customer data from a credit card company. The grocer uses this data to identify its most loyal customers and offer them special promotions and discounts. What type of data is being used in this scenario? First-party Multi-party Third-party
Second-party - CORRECT ANSWER >>> Second-party In data analytics, what term refers to all possible data values in a dataset? Source Representation Population Sample - CORRECT ANSWER >>> Population An entertainment website displays a star rating for a movie based on user reviews. Users can select from one to five whole stars to rate the movie. The star rating is an example of what type of data? Select all that apply. Continuous Discrete Ordinal Nominal - CORRECT ANSWER >>> Discrete Ordinal What type of data is the height of a skyscraper? Discrete Qualitative Nominal Continuous - CORRECT ANSWER >>> Continuous
Subject Character Point - CORRECT ANSWER >>> Field Fill in the blank: A data type is a specific kind of data _____ that tells what kind of value the data is. attribute frame model point - CORRECT ANSWER >>> attribute What are the key characteristics of a text, or string, data type? Select all that apply. Contains textual information Only two possible values Sequence of characters and punctuation Has numerical percentages - CORRECT ANSWER >>> Contains textual information Sequence of characters and punctuation In a data table, where are fields contained? Rows Columns Favorites Charts - CORRECT ANSWER >>> Columns
When using long data, each subject has data in multiple rows. This is because each row represents what? Data in different formats True or false data points One observation per subject Multiple values - CORRECT ANSWER >>> One observation per subject What strategy do data professionals use in order to ensure unbiased sampling? Use random sampling during data collection Write survey questions that encourage specific responses Store data in a spreadsheet Skew results in a certain direction - CORRECT ANSWER >>> Use random sampling during data collection Fill in the blank: Bias is a _____ preference in favor of or against a person, group of people, or thing. conscious or subconscious sensible or insensible fair or unfair standard or substandard - CORRECT ANSWER >>> conscious or subconscious Which of the following are examples of sampling bias? Select all that apply.
Fill in the blank: Data is considered _____ when it is accurate, complete, and unbiased information that has been vetted and proven fit for use. original current comprehensive reliable - CORRECT ANSWER >>> reliable Which of the following are usually good data sources? Select all that apply. Vetted public datasets Social media sites Governmental agency data Academic papers - CORRECT ANSWER >>> Vetted public datasets Governmental agency data Academic papers To determine if a data source is cited, ask which of the following questions? Select all that apply. When was this data last refreshed? Who created this dataset? Is this dataset from a credible organization? Has this dataset been properly cleaned? - CORRECT ANSWER >>> When was this data last refreshed? Who created this dataset? Is this dataset from a credible organization?
A junior data analyst learns that the dataset they have been given is six years old. After looking into this further, they also discover that the age of the data is making the information irrelevant to their project. What good data source principle have they used to evaluate the dataset? Comprehensive Original Reliable Current - CORRECT ANSWER >>> Reliable What are data ethics? Established methods for ensuring data is clean, well-organized, and appropriate for a project Long-standing techniques for confirming that data is always used to benefit society Approved strategies data professionals use to safeguard the privacy and security of a dataset Well-founded standards of right and wrong that dictate how data is collected, shared, and used
- CORRECT ANSWER >>> Well-founded standards of right and wrong that dictate how data is collected, shared, and used What concept states that all data-processing activities and algorithms should be completely explainable and understood by the individual who provides their data? Ownership Currency Privacy Transaction transparency - CORRECT ANSWER >>> Transaction transparency A data analyst removes personally identifying information from a dataset. What task are they performing?
What is the preferred method for open data to be made available? A convenient and modifiable internet download A secure password-protected file A compressed file format that keeps file size small A print copy that is easily shared by anyone - CORRECT ANSWER >>> A convenient and modifiable internet download What are the main benefits of open data? Select all that apply. Combines data from different fields of knowledge Good data is more widely available Restricts data access to certain groups of people Increases the amount of data available for purchase - CORRECT ANSWER >>> Combines data from different fields of knowledge Good data is more widely available What are the key aspects of universal participation? Select all that apply. Certain groups of people must share their private data. No one can place restrictions on data to discriminate against a person or group. Everyone must be able to use, reuse, and redistribute open data. All corporations are allowed to sell open data. - CORRECT ANSWER >>> No one can place restrictions on data to discriminate against a person or group. Everyone must be able to use, reuse, and redistribute open data.
Freedom from inappropriate use of your data is an element of which aspect of data ethics? Consent Transparency Privacy Currency - CORRECT ANSWER >>> Consent A data professional working on a project about commuters researches the origin of a dataset to confirm it was created by a reputable source, such as a government transportation agency. Which aspect of good data are they prioritizing? Original Comprehensive Cited Reliable - CORRECT ANSWER >>> Cited A hospital system wants to protect the personally identifiable information of its patients, such as names and medical records. They ask their data team to anonymize the data. What techniques might they use to achieve this goal? Hashing Masking Sorting Blanking - CORRECT ANSWER >>> Masking Blanking
free access data-processing activities raw data financial transactions - CORRECT ANSWER >>> data-processing activities A government agency allows any business, nonprofit, or citizen to access its databases and reuse or redistribute the data. What type of data is described in this scenario? Closed Allowable Free Open - CORRECT ANSWER >>> Open An investor with a background working in the tech industry interprets any pitch from a tech startup as being more promising than others, even if the information is confusing and ambiguous. What type of bias does this scenario describe? Sampling Interpretation Observer Confirmation - CORRECT ANSWER >>> Interpretation A magazine conducts research about people's reading preferences. They only include respondents who currently subscribe. What type of bias does this scenario describe? Confirmation Interpretation Sampling
Observer - CORRECT ANSWER >>> Sampling Fill in the blank: The data ethics principle of _____ states that an individual has the right to understand all of the data-processing activities and algorithms used on their data. transaction transparency consent ownership currency - CORRECT ANSWER >>> transaction transparency A financial institution publishes data about stock prices and market trends, which any business, nonprofit, or citizen can access, reuse, or redistribute through its online databases. What type of data is described in this scenario? Open Allowable Free Closed - CORRECT ANSWER >>> Open Fill in the blank: A relational database contains a series of _____ that can be connected to form relationships. tables cells fields spreadsheets - CORRECT ANSWER >>> tables What is the term for an identifier that references a database column in which each value is unique?
A large metropolitan high school gives each of its students an ID number to differentiate them in its database. What kind of metadata are the ID numbers? Administrative Structural Descriptive Representative - CORRECT ANSWER >>> Descriptive An international nonprofit organization wants to merge third-party data with its own data. Which of the following actions will help make this process successful? Select all that apply. Use metadata to standardize the datasets. Replace the incoming data's metadata with its own company metadata. Use metadata to evaluate the third-party data's quality and credibility. Alter the internal metadata to more closely reflect the incoming metadata. - CORRECT ANSWER >>> Use metadata to standardize the datasets Use metadata to evaluate the third-party data's quality and credibility Fill in the blank: Data _____ is a process data professionals use to ensure the formal management of their organization's data assets. sourcing governance organization storage - CORRECT ANSWER >>> governance What are some key benefits of open-data initiatives? Select all that apply.
Limit opportunities for collaboration Make government activities more transparent Support innovation and economic growth Help educate citizens about important issues - CORRECT ANSWER >>> Make government activities more transparent Support innovation and economic growth Help educate citizens about important issues What type of file saves data in a table format? Calculated spreadsheet values (.csv) Comma-separated values (.csv) Cell-structured variables (.csv) Compatible scientific variables (.csv) - CORRECT ANSWER >>> Comma-separated values (.csv) Bringing data from a .csv file into a spreadsheet is an example of what process? Filing data Importing data Editing data Normalizing data - CORRECT ANSWER >>> Importing data In Google Sheets, what function enables a data analyst to specify a range of cells in one spreadsheet to be duplicated in another? SPECIFY
By car numerical ID, in descending order - CORRECT ANSWER >>> By return date, in descending order Fill in the blank: To keep a header row at the top of a spreadsheet, highlight the row and select _____ from the View menu. set lock freeze pin - CORRECT ANSWER >>> freeze Which statement is true about sampling, irrespective of sample size? The sample standard deviation (Stdev) is the same as the population Stdev. The sample distribution approximates to normal distribution. The sample bias is reduced if the sample being selected is representative of the population. The sample mean approaches the population mean - CORRECT ANSWER >>> The sample bias is reduced if the sample being selected is representative of the population. A data analyst is performing analysis on data stored in a big data platform with state-of-the-art analysis tools. Insufficient sample size is rendering the current analysis ineffective. What is the primary challenge of a larger sample size? Collecting a larger sample size is more expensive. Analyzing a larger sample size is complex. Storing a larger sample size is difficult. Cleansing a larger sample size is complicated. - CORRECT ANSWER >>> Collecting a larger sample size is more expensive.
In the next six months, an analyst is expected to analyze and present the effects of monthly promotions on sales of a new product released one month ago. Which solution for insufficient data should this analyst pursue? Look for a new data set Identify trends with available data Speak with stakeholders and adjust the objective Wait for more data - CORRECT ANSWER >>> Wait for more data A team leader is assigned the task of evaluating the schema of a data set as part of data cleansing. How would the team leader define a schema to the analyst collaborating on the project prior to commencing cleansing? How well two or more data sets work together A way of describing how something is organized A process of combining two or more data sets into a single data set A way of matching fields in separate databases - CORRECT ANSWER >>> A way of describing how something is organized An analyst performing data cleansing on invoice data would like to select and view rows that have an amount paid that is greater than $100. Which spreadsheet functionality should the analyst use? COUNTIF Conditional formatting Filter Remove duplicates - CORRECT ANSWER >>> Filter