BLOG – Page 4 – Hadoop In Real World

BLOG

February 2, 2017

Finding the MAX tuple with Pig

Finding the MAX tuple with Pig Here is a sample dataset. Our goal is to find the record with maximum record_value which is [crayon-5a37d9f4707e8505393247-i/]  [crayon-5a37d9f4707f7032469063/] Script […]
January 30, 2017

How to find directories in HDFS which are older than N days?

How to find directories in HDFS which are older than N days? Cleaning up older or obsolete files in HDFS is important. Even if you have […]
January 26, 2017

How to use multi character delimiter in a Hive table?

How to use multi character delimiter in a Hive table? Sometimes your data is slightly complex to delimit the individual columns with a single character like […]
January 23, 2017

Change field termination value in Hive

Change field termination value in Hive This blog post describes how to change the field termination value in Hive. Assume when you created the Hive table, […]
January 19, 2017

DataNode process killed due to Incompatible clusterIDs error

DataNode process killed due to Incompatible clusterIDs error This blog post will describe how to address Incompatible clusterIDs with DataNodes. [crayon-5a37d9f472f66034694478/] Problem The problem could be […]
January 16, 2017

FSNamesystem initialization failed

FSNamesystem initialization failed FSNamesystem initialization failed is a common error Hadoop users gets especially if there are trying to set up of a Hadoop cluster for […]