What is the lineage in Spark?

Question

What is the lineage in Spark?

1 Answer

sharadyadav1986 · Answer 1 · 2022-03-29T02:21:20+0000

In Apache Spark, when a transformation (map or filter etc.) is called, it is not executed by Spark immediately; instead, a lineage is created for each transformation. This lineage is used to keep track of what all transformations have to be applied on that RDD. It also traces the location from where it has to read the data.

What is the lineage in Spark?

Please log in or register to answer this question.

1 Answer