Hadoop Starter Kit - Tutorial - Big Data In Real World

Hadoop Starter Kit – Tutorial

HDFS Architecture
February 13, 2017
Introduction to MapReduce
February 20, 2017
HDFS Architecture
February 13, 2017
Introduction to MapReduce
February 20, 2017

Hadoop Starter Kit – Tutorial

In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS, MapReduce and a very good introduction to Apache Pig & Hive. More importantly you can try what you learn in a 3 node Cloudera CDH5 Hadoop cluster for FREE – yes 100% free.

This course is divided in to 4 sections.

Before you go on reading this post, please note that this post and all the links/posts below is from our free course named Hadoop Starter Kit. It is a free introductory course on Hadoop and it is 100% free. Click here to enroll to Hadoop Starter Kit.  You will also get free access to our 3 node Hadoop cluster hosted on Amazon Web Services (AWS) – also free !

Introduction to Big Data

In the very first section, we will see what is Big Data and understand the problems and complexities that comes with storing and analyzing Big Data. We will also see how Hadoop provides a solution and address the complexities involved in Big Data.

What is Big Data ?

Understanding Big Data problem

HDFS

In section 2 we will talk about Hadoop Distributed File System or HDFS which is one of the core components of Hadoop. We will start this section by seeing what is a file system and why we need a new file system like HDFS. You will then learn HDFS commands and try them in our training cluster. Click here to get your free access to the cluster. We will finish this section by learning about the HDFS architecture.

HDFS – Why another filesystem?

Working with HDFS

HDFS Architecture

MapReduce

In section 3, we will learn about MapReduce. First we will get a good introduction to MapReduce and then we will go in depth to understand the phases involved in MapReduce. We will then write a MapReduce program in Java to calculate the maximum closing price of stock symbols from a stock dataset. We will go over the MapReduce program in detail.

Introduction to MapReduce

Dissecting MapReduce components

Dissecting MapReduce program (Part 1)

Dissecting MapReduce program (Part 2)

We hope you are excited to start learning Hadoop. Just to remind you again, please note that this post and other links in this post are from our free course named Hadoop Starter Kit. It is a free introductory course on Hadoop and it is 100% free. Click here to enroll to Hadoop Starter Kit.  You will also get free access to our 3 node Hadoop cluster hosted on Amazon Web Services (AWS) – also free !

Big Data In Real World
Big Data In Real World
We are a group of Big Data engineers who are passionate about Big Data and related Big Data technologies. We have designed, developed, deployed and maintained Big Data applications ranging from batch to real time streaming big data platforms. We have seen a wide range of real world big data problems, implemented some innovative and complex (or simple, depending on how you look at it) solutions.

Comments are closed.

Hadoop Starter Kit – Tutorial
This website uses cookies to improve your experience. By using this website you agree to our Data Protection Policy.

Hadoop In Real World is now Big Data In Real World!

X