Download Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 and more Exams Nursing in PDF only on Docsity! Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 Which of the following is an example of big data utilized in action today? - Individual, Unconnected Hospital Databases - Social Media - Wi-Fi Networks - The Internet - Correct Answer ✅Social Media Question 2 What reasoning was given for the following: why is the "data storage to price ratio" relevant to big data? - Companies can't afford to own, maintain, and spend the energy to support large data storage unless the cost is sufficiently low. - Larger storage means easier accessibility to big data for every user because it allows users to download in bulk. - Lower prices mean larger storage becomes easier to access for everyone, creating bigger amounts of data for client-facing services to work with. - It isn't, it was just an arbitrary example of big data usage. - Correct Answer ✅Larger storage means easier accessibility to big data for every user because it allows users to download in bulk. What is the best description of personalized marketing enabled by big data? Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 - Marketing to each customer on an individual level and suiting to their needs. - Being able to use personalized data from every single customer for personalized marketing needs. - Being able to obtain and use customer information for groups of consumers and utilize them for marketing needs. - Correct Answer ✅Being able to use personalized data from every single customer for personalized marketing needs. Of the following, which is an example of personalized marketing related with big data? - Google ordering ads to show items based on recent and past search results. - A survey that asks your age and markets to you a specific brand. - News outlets gathering information from the internet in order to report them to the public. - Correct Answer ✅Google ordering ads to show items based on recent and past search results. What is the workflow for working with big data? - Theory -> Models -> Precise Advice - Extrapolation -> Understanding -> Reproducing - Big Data -> Better Models -> Higher Precision - Correct Answer ✅Big Data -> Better Models -> Higher Precision Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 Where does the real value of big data often come from? - Having data-enabled decisions and actions from the insights of new data. - Using the three major data sources: Machines, People, and Organizations. - Combining streams of data and analyzing them for new insights. - Size of the data. - Correct Answer ✅Combining streams of data and analyzing them for new insights. What does it mean for a device to be "smart"? - Must have a way to interact with the user. - Having a specific processing speed in order to keep up with the demands of data processing. - Connect with other devices and have knowledge of the environment. - Correct Answer ✅Connect with other devices and have knowledge of the environment. What does the term "in situ" mean in the context of big data? - The sensors used in airplanes to measure altitude. - Accelerometers. - Bringing the computation to the location of the data. - In the situation - Correct Answer ✅Bringing the computation to the location of the data. Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 Which of the following are reasons mentioned for why data generated by people are hard to process? Choose all that apply. - The velocity of the data is very high - Skilled people to analyze the data are hard to come by. - Very unstructured data. - They cannot be modeled and stored. - Correct Answer ✅The velocity of the data is very high Very unstructured data. What is the purpose of retrieval and storage; pre-processing; and analysis in order to convert multiple data sources into valuable data? - To enable ETL methods. - Designed to work like the ETL process. - To allow scalable analytical solutions to big data. - Since the multi-layered process is built into the Neo4j database connection. - Correct Answer ✅To allow scalable analytical solutions to big data. Which of the following are benefits of organization-generated data? Choose all that apply. - Improved Safety - Higher Sales Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 - Customer Satisfaction - Better Profit Margins - High Velocity - Correct Answer ✅Improved Safety Higher Sales Customer Satisfaction Better Profit Margins What are data silos and why are they bad? - A giant centralized database to house all the data produces within an organization. Bad because it is hard to maintain as highly structured data. - Highly unstructured data. Bad because it does not provide meaningful results for organizations. - A giant centralized database to house all the data production within an organization. Bad because it hinders opportunity for data generation. - Data produced from an organization that is spread out. Bad because it creates unsynchronized and invisible data. - Correct Answer ✅Data produced from an organization that is spread out. Bad because it creates unsynchronized and invisible data. Which of the following are benefits of data integration? Choose all that apply. Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 What is the veracity of big data? The size of the data. The connectedness of data. The abnormality or uncertainties of data. The speed at which data is produced. - Correct Answer ✅The abnormality or uncertainties of data. What are the challenges of data with high variety? - Hard to perform emergent behavior analysis. - Hard to integrate. - The quality of data is low. - Hard in utilizing group event detection - Correct Answer ✅Hard to integrate Which of the following is the best way to describe why it is crucial to process data in real-time? - Prevents missed opportunities. - More accurate. - Batch processing is an older method that is not as accurate as real- time processing. - More expensive to batch process. - Correct Answer ✅Prevents missed opportunities. Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 What are the challenges with big data that has high volume? - Effectiveness and Cost - Speed Increase in Processing - Cost, Scalability, and Performance - Storage and Accessibility - Correct Answer ✅Cost, Scalability, and Performance Which of the following are parts of the 5 P's of data science and what is the additional P introduced in the slides? - Process - Programmability - Product - Purpose - Platforms - Perception - People - Correct Answer ✅Process Programmability Product Purpose Platforms People Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 Which of the following are part of the four main categories to acquire, access, and retrieve data? - Remote Data - Traditional Databases - Web Services - Text Files - NoSQL Storage - Correct Answer ✅Remote Data Traditional Databases Text Files NoSQL Storage What are the steps required for data analysis? - Select Technique, Build Model, Evaluate - Classification, Regression, Analysis - Regression, Evaluate, Classification - Investigate, Build Model, Evaluate - Correct Answer ✅Select Technique, Build Model, Evaluate Of the following, which is a technique mentioned in the videos for building a model? - Analysis Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 What is done to the data in the preparation stage? - Understand Nature of Data and Preliminary Analysis. - Retrieve Data - Select Analytical Techniques - Build Models - Identify Data Sets and Query Data - Correct Answer ✅Understand Nature of Data and Preliminary Analysis. Which of the following is the best description of why it is important to learn about the foundations for big data? - Foundations help you revisit calculus concepts required in the understanding of big data. - Foundations allow for the understanding of practical concepts in Hadoop. - Foundations is all that is required to show a mastery of big data concepts. - Foundations stand the test of time. - Correct Answer ✅Foundations allow for the understanding of practical concepts in Hadoop. What is the benefit of a commodity cluster? - Prevents network connection failure - Much faster than a traditional super computer - Enables fault tolerance Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 - Prevents individual component failures - Correct Answer ✅Enables fault tolerance What is a way to enable fault tolerance? - Better LAN Connection - Data-Parallel Job Restart - System Wide Restart - Distributed Computing - Correct Answer ✅Data-Parallel Job Restart What are the specific benefit(s) to a distributed file system? - High Concurrency - High Fault Tolerance - Data Scalability - Large Storage - Correct Answer ✅High Concurrency High Fault Tolerance Data Scalability Which of the following are general requirements for a programming language in order to support big data models? - Handle Fault Tolerance - Utilize Map Reduction Methods - Support Big Data Operations Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 - Enable Adding of More Racks - Optimization of Specific Data Types - Correct Answer ✅Handle Fault Tolerance Support Big Data Operations Enable Adding of More Racks Optimization of Specific Data Types What does IaaS provide? - Computing Environment - Hardware Only - Software On-Demand - Correct Answer ✅Hardware Only What does PaaS provide? - Software On-Demand - Computing Environment - Hardware Only - Correct Answer ✅Computing Environment What does SaaS provide? - Software On-Demand - Hardware Only - Computing Environment - Correct Answer ✅Software On-Demand Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 - Advanced Alogrithms - Data Level Parallelism - Random Data Access - Infrastructure Replacement - Task Level Parallelism - Correct Answer ✅Advanced Alogrithms Random Data Access Infrastructure Replacement Task Level Parallelism As covered in the slides, which of the following are the major goals of Hadoop? - Handle Fault Tolerance - Latency Sensitive Tasks - Enable Scalability - Provide Value for Data - Optimized for a Variety of Data Types - Facilitate a Shared Environment - Correct Answer ✅Handle Fault Tolerance Latency Sensitive Tasks Enable Scalability Provide Value for Data Optimized for a Variety of Data Types Big Data 1/6 Question And Answers (Quizzes with Answers 2024/2023 Facilitate a Shared Environment What is the purpose of YARN? - Allows various applications to run on the same Hadoop cluster. - Enables large scale data across clusters. - Implementation of Map Reduce. - Correct Answer ✅Allows various applications to run on the same Hadoop cluster. What are the two main components for a data computation framework that were described in the slides? - Resource Manager and Container - Resource Manager and Node Manager - Node Manager and Applications Master - Node Manager and Container - Applications Master and Container - Correct Answer ✅Resource Manager and Node Manager