

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
HDFS commands for list files, read/write files, upload/download files, file management, filesystem, administration etc
Typology: Cheat Sheet
1 / 2
This page cannot be seen from the preview
Don't miss anything!


hdfs dfs - ls / List all the files/directories for the given hdfs destination path. hdfs dfs - ls - d /hadoop Directories are listed as plain files. In this case, this command will list the details of hadoop folder. hdfs dfs - ls - h /data Format file sizes in a human-readable fashion (eg 64.0m instead of 67108864). hdfs dfs - ls - R /hadoop Recursively list all files in hadoop directory and all subdirectories in hadoop directory. hdfs dfs - ls /hadoop/dat* List all the files matching the pattern. In this case, it will list all the files inside hadoop directory which starts with 'dat'.
hdfs dfs - text /hadoop/derby.log HDFS Command that takes a source file and outputs the file in text format on the terminal. The allowed formats are zip and TextRecordInputStream. hdfs dfs - cat /hadoop/test This^ command will display the content of the HDFS file test on your stdout. hdfs dfs - appendToFile /home/ubuntu/test /hadoop/text Appends the content of a local file test1 to a hdfs file test2.
hdfs dfs - put /home/ubuntu/sample /hadoop Copies the file from local file system to HDFS. hdfs dfs - put - f /home/ubuntu/sample /hadoop Copies the file from local file system to HDFS, and in case the local already exits in the given destination path, using - f option with put command will overwrite it. hdfs dfs - put - l /home/ubuntu/sample /hadoop Copies the file from local file system to HDFS. Allow DataNode to lazily persist the file to disk. Forces replication factor of 1. hdfs dfs - put - p /home/ubuntu/sample /hadoop Copies the file from local file system to HDFS. Passing - p preserves access and modification times, ownership and the mode. hdfs dfs - get /newfile /home/ubuntu/ Copies the file from HDFS to local file system. hdfs dfs - get - p /newfile /home/ubuntu/ Copies the file from HDFS to local file system. Passing - p preserves access and modification times, ownership and the mode. hdfs dfs - get /hadoop/*.txt /home/ubuntu/ Copies all the files matching the pattern from local file system to HDFS. hdfs dfs - copyFromLocal /home/ubuntu/sample /hadoop Works similarly to the^ put^ command, except that the source is restricted to a local file reference. hdfs dfs - copyToLocal /newfile /home/ubuntu/ Works similarly to the^ put^ command, except that the destination is restricted to a local file reference. hdfs dfs - moveFromLocal /home/ubuntu/sample /hadoop Works similarly to the put command, except that the source is deleted after it's copied.
hdfs dfs - cp /hadoop/file1 /hadoop Copies file from source to destination on HDFS. In this case, copying file1 from hadoop directory to hadoop1 directory. hdfs dfs - cp - p /hadoop/file1 /hadoop Copies file from source to destination on HDFS. Passing - p preserves access and modification times, ownership and the mode. hdfs dfs - cp - f /hadoop/file1 /hadoop1 Copies^ file from source to destination on HDFS. Passing^ - f overwrites the destination if it already exists. hdfs dfs - mv /hadoop/file1 /hadoop Move files that match the specified file pattern
hdfs dfs - rm - r /hadoop hdfs dfs - rm - R /hadoop hdfs dfs - rmr /hadoop Deletes the directory and any content under it recursively. hdfs dfs - rm - skipTrash /hadoop The^ - skipTrash option will bypass trash, if enabled, and delete the specified file(s) immediately. hdfs dfs - rm - f /hadoop If the file does not exist, do not display a diagnostic message or modify the exit status to reflect an error. hdfs dfs - rmdir /hadoop1 Delete a directory. hdfs dfs - mkdir /hadoop2 Create a directory in specified HDFS location. hdfs dfs - mkdir - f /hadoop Create a directory in specified HDFS location. This command does not fail even if the directory already exists. hdfs dfs - touchz /hadoop Creates a file of zero length at