BLOG – Page 12 – Hadoop In Real World

BLOG

June 5, 2015
The power of Big Data

The Power Of Big Data

One of my close friends recently joined Microsoft in Seattle in their highly acclaimed data analysis team. I asked him what was his first assignment. He […]
October 22, 2014
How to prepare for Hadoop interview - part 1

Preparing for Hadoop Interview

3 years ago only a small number of companies were using Hadoop. Now Hadoop technology has grown leaps and bounds so as its user base. Companies […]
September 24, 2014

How do you debug a performance issue or a long running job in Hadoop?

This post will explain how can you approach the above question when asked in an interview. This is an open ended interview question and the interviewer […]
June 18, 2014
Hadoop Tool Runner

Explaining ToolRunner

This post explains the class relationship when we use ToolRunner to run a MapReduce job. It is not really complicated but we use the below pictorial […]
May 11, 2014
MapReduce - MRUnit

MRUnit To Test MapReduce

This post explains how to unit test a MapReduce program using MRUnit. Apache MRUnit ™ is a Java library that helps developers unit test Apache Hadoop map […]
April 19, 2014
Million Song Dataset

Using Million Song Dataset In Hadoop

What is Million Song Dataset ?    The Million Song Dataset is a freely-available collection of audio features and metadata for a million   contemporary popular music tracks. The […]