0 votes
in Spark Preliminaries by
How does Spark divide up data?

1 Answer

0 votes
by

The map-reduce API is used for the data partition in Spark. In the input format, one can make more than one partition. For best performance, the HDFS block size is the partition size, but you can change partition sizes with tools like Split.

Related questions

+1 vote
asked Sep 16, 2022 in Spark Preliminaries by sharadyadav1986
0 votes
asked Feb 16, 2023 in Spark Preliminaries by Robindeniel
...