BLOG – Page 12 – Hadoop In Real World


August 9, 2015

Changing Number Of Mappers

Changing Number Of Mappers Number of mappers always equals to the Number of splits. Having said that it is possible to control the number of splits […]
August 4, 2015

InputSplit vs Block

InputSplit vs Block The central idea behind MapReduce is distributed processing and hence the most important thing is to divide the dataset in to chunks and […]
August 1, 2015

HDFS Block Placement Policy

HDFS Block Placement Policy When a file is uploaded in to HDFS it will be divided in to blocks. HDFS will have to decide where to […]
July 28, 2015

Data Locality in Hadoop

Data Locality in Hadoop Data Locality in Hadoop refers to the “proximity” of the data with respect to the Mapper tasks working on the data. Why […]