PUMA: Benchmarking MapReduce Performance
MapReduce remains a foundational programming model for processing large-scale distributed datasets, though its role has evolved alongside modern data processing frameworks. Hadoop implementations and other MapReduce systems require objective performance evaluation to guide infrastructure decisions and optimization efforts. PUMA (Princeton University MapReduce Benchmark) is a comprehensive benchmark suite developed by Faraz Ahmad and colleagues to…