When uploading a file by the
hdfs dfs -put command line in HDFS, how to set a replication factor instead of the global one for that file?
For example, HDFS’s global replication factor is 3. For some temporary files, I would like to save just one copy for faster uploading and saving disk space.
The replication factor of files to be put by
hdfs dfs -put is from the property
dfs.replication from hdfs-site.xml.
hdfs command allows you to overwrite the properties by the
Hence, to save a file by only one replica, you can use the command as follows.
hdfs dfs -Ddfs.replication=1 -put /path/to/local/file /path/to/hdfs/dir