0 votes
in Apache Spark by

How is Apache Spark different from MapReduce?

1 Answer

0 votes
by

Apache Spark

MapReduce

Spark processes data in batches as well as in real-time

MapReduce processes data in batches only

Spark runs almost 100 times faster than Hadoop MapReduce

Hadoop MapReduce is slower when it comes to large scale data processing

Spark stores data in the RAM i.e. in-memory. So, it is easier to retrieve it

Hadoop MapReduce data is stored in HDFS and hence takes a long time to retrieve the data

Spark provides caching and in-memory data storage

Hadoop is highly disk-dependent

Related questions

0 votes
asked Feb 15, 2023 in Spark Preliminaries by Robindeniel
0 votes
asked Jun 20, 2023 in HDFS by john ganales
...