


Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Main points of this past exam are: Big Data, Big Data, Technology Sector, Specific Challenges, Parallelized, Big Data Technologies, Apache Hadoop Project, Current Market Landscape, Contributing Vendors, Hadoop Ecosystem
Typology: Exams
1 / 4
This page cannot be seen from the preview
Don't miss anything!



On special offer
Semester 1 Examinations 2012/
Note to Candidates: Please check the Programme Title and the Module Title to ensure that you are attempting the correct examination. If in doubt please contact an Invigilator.
Section A
(Both questions are mandatory)
(a) Discuss in detail the trend of big data in the technology sector and other diverse sectors (supported with relevant statistics) along with the specific challenges that “big data” presents and why/how cloud computing is addressing these.
(b) State briefly the motivation for distributed and parallelized big data technologies and provide a detailed description of the current market landscape choosing the Apache Hadoop project as an example i.e. contributing vendors, distributions etc. [12]
Total 30 Marks
Describe in detail the following:
(a) The structure and operation of the MapReduce paradigm illustrating with a detailed example. [10] (b) The structure and operation of HDFS illustrating with a detailed example. [10] (c) The extended Hadoop ecosystem, its sub projects and their purpose. [4] (d) Choosing the Mahout distributed machine learning library, list the algorithms it offers, stating their purpose and provide a brief example of where these could be used using big data sets. [6]
Total 30 Marks
(i) Perform a hierarchical cluster analysis using the distances between cities above. [10]
(ii) Draw the corresponding dendrogram and recommend two cities in which to situate warehouses. [5]
(iii) Briefly list the limitations of such analysis. [5]
Total 20 marks