Jan 13, 2020 in Big Data | Hadoop
Q: What are the operations that can cause a shuffle in Spark?

1 Answer

0 votes
Jan 13, 2020

Some of the common operations that can cause a shuffle internally in Spark are as follows:

1. Repartition

2. Coalesce

3. GroupByKey

4. ReduceByKey

5. Cogroup

6. Join

Click here to read more about Loan/Mortgage
Click here to read more about Insurance

Related questions

0 votes
Mar 14, 2020 in Spark Sql
...