BLOG – Hadoop In Real World


October 18, 2017

Q&A Session – Spark, Flink, Cluster Sizing and more

We hosted a webinar Saturday, October 14th 2017 and we answered some great questions that was posted by Hadoop In Real World community and also from […]
October 11, 2017

What is RDD?

We see time and time again, folks who are trying to understand RDD, ask questions about RDD online in websites like stackoverflow or other tech. forums […]
October 4, 2017

RCFile vs. ORC

We hosted a webinar Saturday, September 30th 2017 and the topic that was covered was RCFile vs. ORC. We had over 60 participants in the webinar. […]
September 27, 2017
AWS home screen

Creating EC2 Instances in AWS to Launch a Hadoop Cluster

This is one of the most widely requested post from our community. We have seen over and over again in other courses where they would start […]
September 20, 2017
Apache Ambari Dashboard

Installing and Configuring a Hadoop Cluster with Apache Ambari

Apache Ambari is an open source project and its main purpose is to install or deploy, manage and monitor Hadoop clusters. In this post we will […]
September 13, 2017
Spark vs Hadoop - Comparison Chart

Spark vs. Hadoop – Who Wins?

When you first heard about Spark, you probably did a quick google search to find out that Apache Spark runs programs up to 100 times faster […]