Big Data
-
Software Development
Big Data 2013 Predictions
If you just invested a lot of money in a Big Data solution from any of the traditional BI vendors…
Read More » -
Enterprise Java
Hadoop Hangover: Launch a hadoop cluster CDH4 using Apache Whirr
This post is about how-to launch a CDH4 MRv1 or CDH4 Yarn cluster on EC2 instances. It’s said that you…
Read More » -
Enterprise Java
MapReduce Algorithms – Secondary Sorting
We continue with our series on implementing MapReduce algorithms found in Data-Intensive Text Processing with MapReduce book. Other posts in…
Read More » -
Enterprise Java
Calculating A Co-Occurrence Matrix with Hadoop
This post continues with our series of implementing the MapReduce algorithms found in the Data-Intensive Text Processing with MapReduce book.…
Read More » -
Software Development
How to be Big Data-native?
Big data has spawned a set of tools that deliver results beyond the buzz. It has started delivering real insights…
Read More » -
Enterprise Java
Hadoop + Amazon EC2 – An updated tutorial
There is an old tutorial placed at Hadoop’s wiki page: http://wiki.apache.org/hadoop/AmazonEC2, but recently I had to follow this tutorial and…
Read More » -
Enterprise Java
Lets Crunch big data
As developers our focus is on simpler, effective solutions and thus one of the most valued principle is “Keep it…
Read More » -
Enterprise Java
MapReduce: Working Through Data-Intensive Text Processing – Local Aggregation Part II
This post continues with the series on implementing algorithms found in the Data Intensive Processing with MapReduce book. Part one…
Read More » -
Enterprise Java
MapReduce: Working Through Data-Intensive Text Processing
It has been a while since I last posted, as I’ve been busy with some of the classes offered by…
Read More »