Spark core concepts explained

Apache Spark architecture is based on two main abstractions RDD and DAG, let’s dive in what those concepts are

Kirill Bobrov
7 min readFeb 5, 2021

--

Apache Spark is considered as a powerful complement to Hadoop, big data’s original technology. Spark is a more accessible, powerful and capable big data tool for tackling various big data challenges. It has become mainstream and…

--

--

Kirill Bobrov

helping robots conquer the earth and trying not to increase entropy using Python, Data Engineering, ML. Linkedin @luminousmen. Check out my blog—luminousmen.com