in Big Data | Hadoop by
Q:
What are the operations that can cause a shuffle in Spark?

1 Answer

0 votes
by

Some of the common operations that can cause a shuffle internally in Spark are as follows:

1. Repartition

2. Coalesce

3. GroupByKey

4. ReduceByKey

5. Cogroup

6. Join

Related questions

0 votes
asked Mar 14, 2020 in Spark Sql by rajeshsharma
0 votes
asked Sep 23, 2019 in Salesforce by john ganales
...