0 votes
in Big Data | Hadoop by
What are the operations that can cause a shuffle in Spark?

1 Answer

0 votes
by

Some of the common operations that can cause a shuffle internally in Spark are as follows:

1. Repartition

2. Coalesce

3. GroupByKey

4. ReduceByKey

5. Cogroup

6. Join

Related questions

0 votes
asked Jan 13, 2020 in Big Data | Hadoop by sharadyadav1986
0 votes
asked Jan 13, 2020 in Big Data | Hadoop by sharadyadav1986
...