Apache Hadoop
-
Software Development
Apache Hadoop 2.4.0
The Apache community has voted to release Apache Hadoop 2.4.0, so the new release is now available and consists of…
Read More » -
Software Development
Hadoop MapReduce Concepts
What do you mean by Map-Reduce programming? MapReduce is a programming model designed for processing large volumes of data in…
Read More » -
Enterprise Java
MapReduce Algorithms – Understanding Data Joins Part II
It’s been awhile since I last posted, and like last time I took a big break, I was taking some…
Read More » -
Enterprise Java
Coordination and service discovery with Apache Zookeeper
Service-oriented design has proven to be a successful solution for a huge variety of different distributed systems. When used properly,…
Read More » -
Enterprise Java
Configuring Hadoop with Guava MapSplitters
In this post we are going to provide a new twist on passing configuration parameters to a Hadoop Mapper via…
Read More » -
Enterprise Java
Unit testing a Java Hadoop job
In my previous post I showed how to setup a complete Maven based project to create a Hadoop job in…
Read More » -
Enterprise Java
Run your Hadoop MapReduce job on Amazon EMR
I have posted a while ago how to setup an EMR cluster by using CLI. In this post I will…
Read More » -
Enterprise Java
Writing a Hadoop MapReduce task in Java
Although Hadoop Framework itself is created with Java the MapReduce jobs can be written in many different languages. In this…
Read More » -
Enterprise Java
Big Data Open Source Security
In security there has never (IMHO) been enough open source solutions and Bruce Schneier has written about this several times…
Read More »