0 votes
in Big Data | Hadoop by
What are the operations that can cause a shuffle in Spark?

1 Answer

0 votes
by

Some of the common operations that can cause a shuffle internally in Spark are as follows:

1. Repartition

2. Coalesce

3. GroupByKey

4. ReduceByKey

5. Cogroup

6. Join

...