Apache Hadoop
-
Enterprise Java
MapReduce Algorithms – Understanding Data Joins Part 1
In this post we continue with our series of implementing the algorithms found in the Data-Intensive Text Processing with MapReduce…
Read More » -
Enterprise Java
Distributed System Development Considerations
There are a number of factors to take into account while developing distributed software systems. If you don’t even know…
Read More » -
Enterprise Java
Setting up Apache Hadoop Multi – Node Cluster
We are sharing our experience about Apache Hadoop Installation in Linux based machines (Multi-node). Here we will also share our…
Read More » -
Enterprise Java
Running Map-Reduce Job in Apache Hadoop (Multinode Cluster)
We will describe here the process to run MapReduce Job in Apache Hadoop in multinode cluster. To set up Apache…
Read More » -
Enterprise Java
Hadoop setup on single node and multi node
We will describe Hadoop setup on single node and multi node. The Hadoop environment setup and configuration will be described…
Read More » -
Enterprise Java
How Hadoop Works? HDFS case study
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters…
Read More » -
Enterprise Java
Ganglia configuration for a small Hadoop cluster and some troubleshooting
Ganglia is an open-source, scalable and distributed monitoring system for large clusters. It collects, aggregates and provides time-series views of…
Read More » -
Enterprise Java
Hadoop Books Giveaway – Roundup
Fellow geeks, Our giveaway of Packt Publishing’s books on Apache Hadoop has ended. You may find the original post for…
Read More » -
Enterprise Java
Spring meets Apache Hadoop
SpringSource has just announced the first GA release of Spring for Apache Hadoop. The goal of this project is to…
Read More »