












































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
This exam evaluates a candidate's ability to develop applications using the Hortonworks Data Platform. Topics include Hadoop, HDFS, Hive, Pig, and MapReduce programming.
Typology: Exams
1 / 84
This page cannot be seen from the preview
Don't miss anything!













































































Question 1. Which HDFS command is used to display a list of files and directories within a specified HDFS directory? A) hadoop fs - ls B) hadoop fs - put C) hadoop fs - cat D) hadoop fs - rm Answer: A Explanation: The hadoop fs - ls command lists files and directories in the specified HDFS directory. Question 2. What is the default block size in HDFS for Hadoop 2.x and later? A) 32 MB B) 64 MB C) 128 MB D) 256 MB Answer: C Explanation: Hadoop 2.x and later use a default block size of 128 MB to optimize storage and processing. Question 3. Which command moves a file from the local file system to HDFS? A) hadoop fs - get B) hadoop fs - moveFromLocal C) hadoop fs - put D) hadoop fs - delete Answer: B
Explanation: The hadoop fs - moveFromLocal command moves files from the local file system to HDFS, deleting the source file. Question 4. What does the HDFS command hadoop fs - chmod 755 /data/file.txt do? A) Deletes the file B) Changes file permissions to rwxr-xr-x C) Copies the file D) Renames the file Answer: B Explanation: chmod 755 sets file permissions to read, write, execute for owner, and read, execute for group and others. Question 5. Which HDFS operation allows you to view the contents of a file? A) hadoop fs - ls B) hadoop fs - cat C) hadoop fs - rm D) hadoop fs - mkdir Answer: B Explanation: The hadoop fs - cat command displays the contents of files stored in HDFS. Question 6. How do you check disk usage for a directory in HDFS? A) hadoop fs - du B) hadoop fs - ls C) hadoop fs - chmod
D) hadoop fs - cat Answer: A Explanation: hadoop fs - mkdir creates directories in HDFS. Question 10. What does the hadoop fs - cp command do? A) Copies files within HDFS B) Copies files from local to HDFS C) Changes file permissions D) Deletes files Answer: A Explanation: hadoop fs - cp copies files and directories within the HDFS filesystem. Question 11. How can you rename a file in HDFS? A) hadoop fs - mv B) hadoop fs - rm C) hadoop fs - ls D) hadoop fs - get Answer: A Explanation: hadoop fs - mv moves or renames files in HDFS. Question 12. Which command displays the replication factor of a file in HDFS? A) hadoop fs - stat %r B) hadoop fs - du C) hadoop fs - ls
D) hadoop fs - chmod Answer: A Explanation: hadoop fs - stat %r shows the replication factor of the specified file. Question 13. What is the purpose of the hadoop fs - copyToLocal command? A) Copies files from HDFS to local file system B) Copies files from local to HDFS C) Renames files D) Deletes files Answer: A Explanation: hadoop fs - copyToLocal copies files from HDFS to the local file system. Question 14. Which command in HDFS is used to recursively list all files in a directory? A) hadoop fs - ls - R B) hadoop fs - cat C) hadoop fs - mkdir D) hadoop fs - rm - r Answer: A Explanation: hadoop fs - ls - R lists all files recursively in a directory. Question 15. How do you check the quota for a directory in HDFS? A) hadoop fs - count - q B) hadoop fs - du C) hadoop fs - ls
D) hadoop fs - mkdir Answer: A Explanation: hadoop fs - chown changes the owner and group of files in HDFS. Question 19. Which command shows file size in human-readable format in HDFS? A) hadoop fs - ls - h B) hadoop fs - du C) hadoop fs - cat D) hadoop fs - mv Answer: A Explanation: hadoop fs - ls - h lists file sizes in human-readable format. Question 20. What does the hadoop fs - test - e command do? A) Tests if a file exists in HDFS B) Tests for file permissions C) Tests for file replication D) Tests for file size Answer: A Explanation: hadoop fs - test - e checks for the existence of a file in HDFS. Question 21. In Hadoop, what is a DataNode responsible for? A) Storing and managing HDFS blocks B) Scheduling jobs C) Managing cluster resources
D) Monitoring applications Answer: A Explanation: DataNodes store and manage blocks of data in HDFS. Question 22. What is the function of NameNode in HDFS? A) Maintains metadata and namespace B) Stores data blocks C) Manages logs D) Schedules jobs Answer: A Explanation: NameNode maintains file system metadata and namespace information in HDFS. Question 23. How does HDFS ensure high availability of data? A) Data replication across multiple nodes B) Data compression C) Data encryption D) Data partitioning Answer: A Explanation: HDFS replicates data blocks across multiple DataNodes for high availability. Question 24. Which command is used to copy files from local file system into HDFS? A) hadoop fs - put B) hadoop fs - get C) hadoop fs - mv
D) Copies files recursively Answer: A Explanation: hadoop fs - rm - r recursively deletes directories and their contents in HDFS. Question 28. Which command is used to copy multiple files to HDFS at once? A) hadoop fs - put file1 file2 /dest/ B) hadoop fs - get file1 file2 /dest/ C) hadoop fs - mv file1 file2 /dest/ D) hadoop fs - ls file1 file2 /dest/ Answer: A Explanation: hadoop fs - put allows copying multiple files to HDFS in one command. Question 29. How can you check the replication factor for all files in a directory? A) hadoop fs - ls - R /dir/ | awk '{print $5}' B) hadoop fs - du /dir/ C) hadoop fs - chmod /dir/ D) hadoop fs - rm /dir/ Answer: A Explanation: hadoop fs - ls - R lists files recursively; awk extracts the replication factor column. Question 30. What is the role of the Secondary NameNode in Hadoop? A) Periodically merges namespace and edit logs B) Stores data blocks C) Runs MapReduce jobs
D) Manages cluster resources Answer: A Explanation: Secondary NameNode merges namespace and edit logs to assist the primary NameNode. Question 31. Which HDFS command is used to check file checksum? A) hadoop fs - checksum B) hadoop fs - ls C) hadoop fs - mv D) hadoop fs - chmod Answer: A Explanation: hadoop fs - checksum computes and displays the checksum of a file in HDFS. Question 32. What does the hadoop fs - stat command provide? A) File statistics like size, modification time, replication factor B) Directory listing C) File permissions D) File content Answer: A Explanation: hadoop fs - stat provides statistics for files in HDFS. Question 33. How do you display HDFS disk usage including all subdirectories? A) hadoop fs - du - s - h /dir/ B) hadoop fs - ls /dir/ C) hadoop fs - cp /dir/
D) Changes file permissions Answer: A Explanation: - skipTrash bypasses the Trash and deletes files immediately. Question 37. Which command displays the block locations for a file in HDFS? A) hdfs fsck /path/to/file - files - blocks B) hadoop fs - ls C) hadoop fs - cat D) hadoop fs - mv Answer: A Explanation: hdfs fsck with - files - blocks flag shows block locations and details. Question 38. How do you set a quota for the number of files and directories in HDFS? A) hdfs dfsadmin - setQuota B) hadoop fs - quota C) hadoop fs - chmod D) hadoop fs - du Answer: A Explanation: hdfs dfsadmin - setQuota sets quotas for directories in HDFS. Question 39. What is the result of hadoop fs - chown user:group /file? A) Changes owner and group of /file B) Changes file permissions C) Displays file content
D) Copies file Answer: A Explanation: - chown sets the user and group ownership for a file. Question 40. Which command is used to check the status of a file or directory in HDFS? A) hadoop fs - stat B) hadoop fs - ls C) hadoop fs - mv D) hadoop fs - du Answer: A Explanation: - stat provides status and statistics about files or directories. Question 41. What is the function of YARN Resource Manager? A) Allocates resources and schedules jobs in the cluster B) Stores data blocks C) Manages HDFS metadata D) Runs MapReduce tasks Answer: A Explanation: The Resource Manager is responsible for resource allocation and job scheduling in YARN. Question 42. What does a YARN Node Manager do? A) Manages containers and monitors resource usage on a node B) Schedules jobs C) Maintains HDFS metadata
D) A database table Answer: A Explanation: A YARN container is an isolated environment with allocated resources for running tasks. Question 46. How can you view application logs in YARN? A) yarn logs - applicationId <app_id> B) hadoop fs - cat C) hadoop fs - ls D) yarn application - list Answer: A Explanation: yarn logs - applicationId retrieves logs for a specific YARN application. Question 47. What is the role of the YARN scheduler? A) Assigns resources to applications based on policies B) Stores files in HDFS C) Monitors disk usage D) Manages file permissions Answer: A Explanation: The YARN scheduler allocates resources to applications according to cluster policies. Question 48. Which component monitors the health of nodes in a YARN cluster? A) Node Manager B) Resource Manager C) NameNode
D) DataNode Answer: A Explanation: Node Managers monitor resource usage and node health. Question 49. What is a YARN application attempt? A) A retry of failed application execution B) A new job submission C) A file operation D) A network connection Answer: A Explanation: YARN automatically retries failed applications as new attempts. Question 50. How do you kill a YARN application? A) yarn application - kill <app_id> B) hadoop fs - rm C) yarn node - list D) hdfs dfsadmin - report Answer: A Explanation: yarn application - kill terminates a running YARN application. Question 51. What information does YARN application tracking URL provide? A) Application status, logs, and resource usage B) File permissions C) HDFS block locations
D) DataNode Answer: A Explanation: ApplicationMaster manages application execution from start to finish. Question 55. How do you monitor container resource usage in YARN? A) Node Manager logs and Resource Manager UI B) hadoop fs - cat C) hdfs dfsadmin - report D) yarn application - list Answer: A Explanation: Resource usage can be monitored via Node Manager logs and the Resource Manager UI. Question 56. What does the YARN Resource Manager high availability feature provide? A) Failover and redundancy for resource management B) Data compression C) File encryption D) Disk partitioning Answer: A Explanation: High availability ensures Resource Manager redundancy and failover capability. Question 57. Which command gives the list of YARN nodes? A) yarn node - list B) yarn application - list C) hadoop fs - ls
D) hdfs dfsadmin - report Answer: A Explanation: yarn node - list displays all nodes registered in the cluster. Question 58. What is the purpose of YARN Capacity Scheduler? A) Allocates resources based on configured capacities for queues B) Schedules file operations C) Monitors disk usage D) Manages file permissions Answer: A Explanation: Capacity Scheduler distributes resources among queues according to capacity settings. Question 59. What is a YARN container launch context? A) Description of environment and resources needed for container execution B) File block details C) Network configuration D) Data compression settings Answer: A Explanation: Launch context specifies resources and environment for container execution. Question 60. Which YARN command provides the history of completed applications? A) yarn application - list - appStates FINISHED B) hadoop fs - ls C) hdfs dfsadmin - report