Lazaro Martull

Clean Water & Sanitation Big Data Analysis (Hadoop & MapReduce)

Overview

This project applies Big Data processing techniques to analyze global clean water access trends aligned with UN Sustainable Development Goal 6 (Clean Water & Sanitation).

Using Hadoop Streaming and MapReduce, large-scale water access data was processed to compute regional and temporal trends. The results were then visualized and interpreted using Python.

Problem & Goal

The goal was to analyze large water-access datasets efficiently and answer questions such as:

What I Built

Big Data Pipeline

Tools & Technologies

Key Takeaways

Files

📄 Final Report: report.pdf

🧠 MapReduce – Regional Analysis:

🧠 MapReduce – Yearly Analysis:

📊 Analysis & Visuals: