How does Spark divide up data?

Question

How does Spark divide up data?

1 Answer

Robindeniel · Answer 1 · 2023-02-16T13:37:21+0000

The map-reduce API is used for the data partition in Spark. In the input format, one can make more than one partition. For best performance, the HDFS block size is the partition size, but you can change partition sizes with tools like Split.

How does Spark divide up data?

Please log in or register to answer this question.

1 Answer