in Big Data | Hadoop by (31.7k points)
What are the operations that can cause a shuffle in Spark?

1 Answer

0 votes
by (32.2k points)

Some of the common operations that can cause a shuffle internally in Spark are as follows:

1. Repartition

2. Coalesce

3. GroupByKey

4. ReduceByKey

5. Cogroup

6. Join

Related questions

0 votes
asked Jan 13, 2020 in Big Data | Hadoop by sharadyadav1986 (31.7k points)
0 votes
asked Jan 13, 2020 in Big Data | Hadoop by sharadyadav1986 (31.7k points)
+1 vote
asked Feb 23, 2020 in Big Data | Hadoop by rahuljain1 (6.5k points)
...