How to check the replication factor of a file in HDFS?

ByEric Ma Mar 24, 2018Mar 24, 2018

A related question: how to find the replication factors of files in a HDFS cluster?

method 1: You can use the HDFS command line to ls the file.

The second column of the output will show the replication factor of the file.

For example,

$ hdfs dfs -ls  /usr/GroupStorage/data1/out.txt
-rw-r--r--   3 hadoop zma 11906625598 2014-10-22 18:35 /usr/GroupStorage/data1/out.txt

The out.txt’s replication factor is 3.

method 2: Get the replication factor using the stat hdfs command tool.

Using the above file as an example:

$ hdfs dfs -stat %r /usr/GroupStorage/data1/out.txt

It will print 3.

Making `fdisk -l` display partition sizes by GB/MB

ByEric Ma Mar 24, 2018Mar 24, 2018

How to make fdisk -l display partition sizes by bytes instead of sectors? fdisk does not have such options as far as I know. However, good news is that you can use parted: # parted -l It will print partition info like # parted -l Model: Linux device-mapper (linear) (dm) Disk /dev/mapper/fedora_vm235-swap: 2164MB Sector size…

Linux | Tutorial

Controlling Filesystem Mounting on Linux using /etc/fstab

ByEric Ma Mar 25, 2015Aug 30, 2020

Controlling the mounting of filesystems is a useful technique for managing Linux systems. The mounting configurations are mostly in the /etc/fstab file. In this post, we will discuss 2 common and useful techniques for controlling the filesystem mounting by playing with the /etc/fstab file: allowing non-root users to mount/unmount filesystems and avoiding mounting failures blocking…

What are the differences between BIOS and UEFI?

ByWeiwei Jia Mar 24, 2018Jan 7, 2020

BIOS: Basic Input Output Systems UEFI: Unified Extensible Firmware Interface UEFI is the advanced BIOS, which solves some limitations in BIOS such as 1, 16-bit processor mode; 2, 1 MB addressable space and PC AT hardware. References: https://en.wikipedia.org/wiki/Unified_Extensible_Firmware_Interface Read more: How to turn a BIOS-based Linux Mint 17 installation to UEFI booting? How to detect…

How to install Flash Player plugin on CentOS 7?

ByEric Ma Mar 24, 2018Mar 24, 2018

Firefox on CentOS7 does not have Flash enabled by default. How to install Flash Player plugin on CentOS 7? You may follow this tutorials to build a freshwrapper based flash plugin for Firefox on CentOS 7. If you don’t want to build it by yourself, you may download the pre-built .so from from this repository….

Linux | Programming

Statically Linking C and C++ Programs on Linux with gcc

ByEric Ma Jan 1, 2014Sep 18, 2022

Before statically linking you C and C++ programs, you should be aware of the drawbacks of the static linking especially with glibc. There are some good discussions already: with glibc you’re linking static programs which are not really static and some others here and here. That said, you can choose to statically link C and…

How to export Google Chrome password on Linux?

ByQ A Mar 24, 2018

How to export my Google Chrome password on Linux to a human-readable text file? In newer versions of Chrome, the passwords are stored using the encrypted password storage provided by the system, either Gnome Keyring or KWallet. We need to force Chrome to use a temporary profile folder with unencrypted password storage. Step 1. Connect…

2 Comments

wert says:

Apr 23, 2020 at 3:23 pm

Hello Eric,

I want to find out all the files having replication factor of 1 and get that changed to 3.
I am unable to get the completed path of these file and directories hence I am unable to change it, would there be a way to get list (including complete path) of all these files with RF 1 so that I can change the replication to 3.

Regards
Wert.

Reply
1. Eric Ma says:
  
  Apr 25, 2020 at 6:40 pm
  
  You can find out the files with replication factor of 1 using the method introduced at https://www.systutorials.com/how-to-find-out-all-files-with-replication-factor-1-in-hdfs/ . Then you can set them. A script may automate the process.
  
  Reply

Similar Posts

2 Comments

Leave a Reply Cancel reply