Apache Spark
-
Software Development
Monitoring Real-Time Uber Data Using Spark Machine Learning, Streaming, and the Kafka API (Part 2)
This post is the second part in a series where we will build a real-time example for analysis and monitoring…
Read More » -
Software Development
Monitoring Real-Time Uber Data Using Spark Machine Learning, Streaming, and the Kafka API (Part 1)
According to Gartner, by 2020, a quarter of a billion connected cars will form a major element of the Internet…
Read More » -
Software Development
Apache Spark: A Quick Start With Python
Spark Overview As per the official website, “Apache Spark is a fast and general engine for large scale data processing”…
Read More » -
Software Development
Predicting Breast Cancer Using Apache Spark Machine Learning Logistic Regression
In this blog post, I’ll help you get started using Apache Spark’s spark.ml Logistic Regression for predicting cancer malignancy. Spark’s…
Read More » -
Software Development
How to Get Started with Spark Streaming and MapR Streams Using the Kafka API
This post will help you get started using Apache Spark Streaming for consuming and publishing messages with MapR Streams and the Kafka…
Read More » -
Enterprise Java
Apache Spark Packages, from XML to JSON
The Apache Spark community has put a lot of effort into extending Spark. Recently, we wanted to transform an XML…
Read More » -
Software Development
Testing Spark Streaming: Unit testing
There is enough evidence to prove the importance of automated testing. Projects in new fields often neglect automated testing, as…
Read More » -
Software Development
Persistent Storage for Enterprise-Grade Spark Applications
Apache Spark is becoming very popular and widely used in the big data community. There are several reasons for Spark…
Read More » -
Software Development
A Functional Approach to Logging in Apache Spark
Logging in Apache Spark is very easy to do, since Spark offers access to a logobject out of the box;…
Read More »