Which limitations of MapReduce Apache Spark can remove?

Question

Which limitations of MapReduce Apache Spark can remove?

1 Answer

sharadyadav1986 · Answer 1 · 2022-03-29T01:53:39+0000

Apache Spark was developed to overcome the limitations of the MapReduce cluster computing paradigm. Apache Spark saves things in memory, whereas MapReduce keeps shuffling things in and out of disk.

Following is a list of few things which are better in Apache Spark:

Apache Spark keeps the cache data in memory, which is beneficial in iterative algorithms and can easily be used in machine learning.
Apache Spark is easy to use as it knows how to operate on data. It supports SQL queries, streaming data as well as graph data processing.
Spark doesn't need Hadoop to run. It can run on its own using other storages like Cassandra, S3, from which Spark can read and write.
Apache Spark's speed is very high as it can run programs up to 100 times faster in-memory or ten times faster on disk than MapReduce.

Which limitations of MapReduce Apache Spark can remove?

Please log in or register to answer this question.

1 Answer