What do you understand by RDD Lineage?

Question

What do you understand by RDD Lineage?

1 Answer

rajeshsharma · Answer 1 · 2022-03-13T12:37:41+0000

The RDD lineage is a procedure that is used to reconstruct the lost data partitions. The Spark does not hold up data replication in the memory. If any data is lost, we have to rebuild it using RDD lineage. This is the best use case as RDD always remembers how to construct from other datasets.

What do you understand by RDD Lineage?

Please log in or register to answer this question.

1 Answer