BLOG – Page 10 – Hadoop In Real World


August 9, 2015

Changing Number Of Mappers

Changing Number Of Mappers Number of mappers always equals to the Number of splits. Having said that it is possible to control the number of splits […]
August 4, 2015

InputSplit vs Block

InputSplit vs Block The central idea behind MapReduce is distributed processing and hence the most important thing is to divide the dataset in to chunks and […]
August 1, 2015

HDFS Block Placement Policy

HDFS Block Placement Policy When a file is uploaded in to HDFS it will be divided in to blocks. HDFS will have to decide where to […]
July 28, 2015

Data Locality in Hadoop

Data Locality in Hadoop Data Locality in Hadoop refers to the “proximity” of the data with respect to the Mapper tasks working on the data. Why […]
July 19, 2015

Hadoop Modes

Hadoop Modes Hadoop cluster is made up of several key process and each process is designed to do a specific task. Here are the key daemons […]
July 14, 2015

JobTracker and TaskTracker

JobTracker and TaskTracker JobTracker and TaskTracker are 2 essential process involved in MapReduce execution in MRv1 (or Hadoop version 1). Both processes are now deprecated in […]