PySpark
-
Python
PySpark – Create Empty Dataframe and RDD
DataFrames and RDDs (Resilient Distributed Datasets) are fundamental abstractions in Apache Spark, a powerful distributed computing framework. Let us delve…
Read More » -
Software Development
Where is Apache Spark heading?
I watched (COVID19-era version of “attended”) the latest spark Summit and in one of the keynotes Reynold Xin from Databricks,…
Read More »