MapReduce Tutorial


MapReduce is a parallel programming model for processing the huge amount of data. MapReduce is the data processing layer of Hadoop and is a software framework for easily writing applications that process vast amount of structured and unstructured data stored in the HDFS. MapReduce is a framework used to write applications to process massive amounts of data in parallel on large clusters of hardware. MapReduce making the structured data out of some unstructured data etc.


The tutorial is intended for Hadoop developers who are having minimum Hadoop and java knowledge.

Audiences & Prerequisites:

The tutorial can be understanble by anyone as all the topics are covered with in-depth information.

Users having the Java and hadoop technologies experience in addition to computation background will be an advantage in understanding the concepts easily. Users without having computation background may need to go through the topic more than once to understand the concept clearly.

Table of contents:

  1. MapReduce Tutorial