How to balance DataNode storage in HDFS?

As nodes are added and deleted in a Hadoop cluster. Storage usage across DataNodes may be different. Some DataNodes' disks are almost used up while some others' are almost empty.

How to balance data across DataNodes in HDFS?

asked Oct 22, 2014 by Eric Z Ma (44,280 points)

1 Answer

Best answer

Hadoop provides the balancer to redistribute the data.

Brief introduction to balancer in Hadoop: balancer.

The design and discussion of balancer in Hadoop: HADOOP-1652.

The command to start balancer: hadoop balancer as the administrator.

answered Oct 22, 2014 by Eric Z Ma (44,280 points)

Please log in or register to answer this question.

Copyright © SysTutorials. User contributions licensed under cc-wiki with attribution required.
Hosted on Dreamhost