0 votes
in Spark Preliminaries by
Why are broadcast variables important when working with Apache Spark?

1 Answer

0 votes
by

Broadcast variables can only be read, and every machine has them in its memory cache. Using broadcast variables when working with Spark, you don't have to send copies of a variable for each task. This lets data be processed faster. Broadcast variables make it possible to store a lookup table in memory, which makes retrieval faster than with an RDD lookup ().

...