in Big Data | Hadoop by
What is the difference between Apache Spark and Apache Hadoop MapReduce?

1 Answer

0 votes

Some of the main differences between Apache Spark and Hadoop MapReduce are follows:

1. Speed: Apache Spark is 10X to 100X faster than Hadoop due to its usage of in memory processing.


2. Memory: Apache Spark stores data in memory, whereas Hadoop MapReduce stores data in hard disk.

3. RDD: Spark uses Resilient Distributed Dataset (RDD) that guarantee fault tolerance. Where Apache Hadoop uses replication of data in multiple copies to achieve fault tolerance.

4. Streaming: Apache Spark supports Streaming with very less administration. This


makes it much easier to use than Hadoop for real-time stream processing.

5. API: Spark provides a versatile API that can be used with multiple data sources as well as languages. It is more extensible than the API provided by Apache Hadoop.


Related questions

+1 vote
asked Feb 23, 2020 in Big Data | Hadoop by rahuljain1
0 votes
asked Jan 26, 2020 in Big Data | Hadoop by rajeshsharma
0 votes
asked Jan 13, 2020 in Big Data | Hadoop by sharadyadav1986
0 votes
asked Feb 22, 2020 in Big Data | Hadoop by SakshiSharma