Big Data
-
Core Java
Apache Arrow on the JVM: Streaming Writes
Previously we went to create some schemas on Arrow. On this blog we will have a look on writing through…
Read More » -
Core Java
Apache Arrow on the JVM: Get Started and Schemas
Arrow is memory format for flat and hierarchical data. It is a popular format used by various big data tools,…
Read More » -
Software Development
Where is Apache Spark heading?
I watched (COVID19-era version of “attended”) the latest spark Summit and in one of the keynotes Reynold Xin from Databricks,…
Read More » -
Enterprise Java
Processing real-time data with Storm, Kafka and ElasticSearch – Part 1
This is an article of processing real-time data with Storm, Kafka and ElasticSearch. 1. Introduction How would you process a…
Read More » -
Enterprise Java
Popular frameworks for big data processing in Java
The big data challenge The concept of big data is understood differently in the variety of domains where companies face…
Read More » -
Software Development
Big data isn’t – well, almost
Back in ancient history (2004) Google’s Jeff Dean & Sanjay Ghemawat presented their innovative idea for dealing with huge data…
Read More » -
Software Development
A guide to the InfluxDBMapper and QueryBuilder for Java Part: 1
With the release of latest influxdb-java driver version came along the InfluxbMapper. To get started we need to spin up…
Read More » -
Agile
The Benefits of Adopting the Agile Approach in IT and Big Data Projects
IT and big data projects can be complex and are often ambiguous by nature. In the case of big data…
Read More » -
Software Development
Apache Kafka & KSQL & TensorFlow for Data Scientists via Python & Jupyter Notebook
Why would a data scientist use Kafka Jupyter Python KSQL TensorFlow all together in a single notebook? There is an…
Read More »