Diagnosing And Repairing Corrupt HDFS Blocks

When you see output like this from hdfs dfsadmin -report, you need to understand what each metric means and how to respond:

Under replicated blocks: 139016
Blocks with corrupt replicas: 9
Missing blocks: 0

Understanding block states in HDFS is essential for cluster health. The distinction between these categories determines how urgent your response needs to be.

Block Categories and What They Mean

Corrupt replica blocks contain at least one replica that failed checksum validation, but the block also has at least one healthy replica available. Data isn’t currently lost, but redundancy is reduced—if another replica fails, you lose data. These blocks should trigger investigation but aren’t emergencies.

Missing blocks occur when none of a block’s replicas are accessible or valid. All replicas are either unavailable (DataNodes down), corrupt, or deleted. Missing blocks mean data is currently inaccessible and require immediate action.

Under-replicated blocks have fewer healthy replicas than the configured replication factor. HDFS automatically re-replicates these over time, but if the process stalls, it indicates underlying issues.

Identifying Affected Files

The first step is always to determine which files are affected. Use hdfs fsck to scan the cluster:

hdfs fsck / -files -blocks -locations

For a summary of problem blocks:

hdfs fsck / -list-corruptfileblocks

To focus on a specific directory:

hdfs fsck /user/hive/warehouse -files -blocks

The output shows block IDs, replica locations, and which replicas are corrupt. Note the pool IDs and DataNode addresses—you’ll need these.

Root Causes and Diagnostics

Hardware or filesystem failures on DataNodes are the most common cause. Check DataNode logs:

tail -f /var/log/hadoop/hdfs/datanode.log

Look for I/O errors, disk full warnings, or checksum failures. Run dmesg on affected DataNodes to catch kernel-level issues.

Offline DataNodes holding replicas. If a DataNode was down during maintenance or crashed, its blocks become unavailable. Check DataNode status:

hdfs dfsadmin -report | grep -A 10 "Dead datanodes"

Bringing the DataNode back online often resolves missing blocks automatically as the NameNode re-replicates.

Single replica files. Files with replication factor 1 cannot recover from a lost replica. Once the block is gone, it’s gone. Identify these:

hdfs fsck / -files -blocks | grep "Replica count: 1"

Network issues causing false corruption detection. Transient network failures during block transfers can trigger false corruption flags. Monitor DataNode-to-DataNode communication.

Recovery Procedures

For corrupt replica blocks with healthy copies:

Delete the corrupt file and restore from backup if available:

hdfs dfs -rm /path/to/corrupt/file

Alternatively, wait for HDFS to automatically purge the corrupt replicas (usually within 24 hours by default, controlled by dfs.blockreport.intervalMsec).

For missing blocks from active files:

First, bring any offline DataNodes back online:

systemctl start hadoop-hdfs-datanode

Wait 10-15 minutes for the DataNode to heartbeat and re-register its blocks. Check progress:

hdfs fsck / -list-corruptfileblocks

If the DataNode cannot come back online, the data is likely unrecoverable. Delete the file:

hdfs dfs -rm -r /path/to/file

And restore from a backup.

For files with replication factor 1:

These have no redundancy. Increase the replication factor immediately:

hdfs dfs -setrep -w 3 /path/to/file

The -w flag waits for replication to complete. Once replication finishes, the block is safe.

Prevention and Best Practices

Set minimum replication to 3 for any critical data. Update hdfs-site.xml:

<property>
  <name>dfs.replication</name>
  <value>3</value>
</property>

Monitor proactively. Check hdfs fsck output weekly and set up alerts on DataNode logs:

hdfs fsck / -list-corruptfileblocks | mail -s "HDFS Corruption Alert" admin@example.com

Maintain healthy DataNode disk space. Corrupt blocks often result from full or nearly-full disks. Keep at least 10% free:

hdfs dfsadmin -report | grep "Configured Capacity"

Replace failing hardware immediately. Don’t let degraded nodes sit in your cluster. If a DataNode is showing signs of disk failure, decomission it and replace the hardware.

Run regular backups. No amount of replication replaces actual backups. Use distcp or external tools to snapshot critical datasets.

When to Escalate

If missing or corrupt blocks persist after bringing DataNodes online, involve your storage team. Run a full filesystem check in safe mode for deeper diagnosis, though be warned this locks the cluster:

hdfs dfsadmin -safemode enter
hdfs fsck / -list-corruptfileblocks
hdfs dfsadmin -safemode leave

Only use this as a last resort for a few files, not routine checks.

2 Comments

Jim Jones says:

Feb 9, 2020 at 1:33 am

Hi. I believe there’s a mistake in the description of corrupt blocks: “A block is called corrupt by HDFS if it has at least one corrupt replica along.” I think that all replicas must be corrupted to be marked as corrupt rather than at least one.

JIRA HDFS-7281 explains the following:
1. A block is missing if and only if all DNs of its expected replicas are dead.
2. A block is corrupted if and only if all its available replicas are corrupted. So if a block has 3 replicas; one of the DN is dead, the other two replicas are corrupted; it will be marked as corrupted.

1. Eric Ma says:
  
  Feb 20, 2020 at 8:14 pm
  
  Hi Jim, yes, the description is not accurate. I fixed it to make it clearer – “A block is “with corrupt replicas” in HDFS if it has at least one corrupt replica along with at least one live replica.”
  
  It refers to the count from the HDFS report.

Diagnosing and Repairing Corrupt HDFS Blocks