Pitfalls and Lessons on Configuing and Tuning Hadoop

This post lists pitfalls and lessons learning when configuring and tuning Hadoop.

Table of Contents

Hadoo doesn’t support IPv6 currently (up to 0.20.2 and 0.21.0): Hadoop and IPv6. The performance of the cluster may suffer from turning IPv6 on in clusters: mail archive.

One good practice is to disable IPv6 on servers in the Hadoop cluster.

Hostname is preferred. Using hostname instead of IP address may possible solve some problem magically.

Take care whether the JVM on all nodes are friendly with Hadoop. In case there are some nodes are configured with wrong JVM, problems may flow up.

One comment

Check the /etc/hosts file whether it contains wrong record. Always ensure it is correct.

One comment