BLOG – Page 2 – Hadoop In Real World

BLOG

October 4, 2017

RCFile vs. ORC

We hosted a webinar Saturday, September 30th 2017 and the topic that was covered was RCFile vs. ORC. We had over 60 participants in the webinar. […]
September 27, 2017
AWS home screen

Creating EC2 Instances in AWS to Launch a Hadoop Cluster

This is one of the most widely requested post from our community. We have seen over and over again in other courses where they would start […]
September 20, 2017
Apache Ambari Dashboard

Installing and Configuring a Hadoop Cluster with Apache Ambari

Apache Ambari is an open source project and its main purpose is to install or deploy, manage and monitor Hadoop clusters. In this post we will […]
September 13, 2017
Spark vs Hadoop - Comparison Chart

Spark vs. Hadoop – Who Wins?

When you first heard about Spark, you probably did a quick google search to find out that Apache Spark runs programs up to 100 times faster […]
March 2, 2017

Dissecting MapReduce Program (Part 2)

Dissecting MapReduce Program (Part 2) In the last post we went over the driver program of a MapReduce program in detail. We will also see InputFormat, OutputFormat […]
February 27, 2017

Dissecting MapReduce Program (Part 1)

Dissecting MapReduce Program (Part 1) From the previous post, we now we have a very good idea about the phases involved in MapReduce. We have a […]