Mar 14, 2020 in Spark Sql
Q: Why we have to use broadcast variables while working with Spark in Spark?

1 Answer

0 votes
Mar 14, 2020

The broadcast variables can be referred to as read-only variables present on every machine(in-memory cache). The broadcast variables usage can eliminate variable ship copy necessity, in this way the data can process at high speed. Storing the lookup table in the memory can be possible through broadcast variables. It is used to enhance the efficiency of retrieval compared to Resilient Distribution Datasets loops.

Click here to read more about Loan/Mortgage
Click here to read more about Insurance

Related questions

...