Jan 13, 2020 in Big Data | Hadoop
Q: What is the difference between Apache Spark and Apache Hadoop MapReduce?

1 Answer

0 votes
Jan 13, 2020

Some of the main differences between Apache Spark and Hadoop MapReduce are follows:

1. Speed: Apache Spark is 10X to 100X faster than Hadoop due to its usage of in memory processing.

 

2. Memory: Apache Spark stores data in memory, whereas Hadoop MapReduce stores data in hard disk.

3. RDD: Spark uses Resilient Distributed Dataset (RDD) that guarantee fault tolerance. Where Apache Hadoop uses replication of data in multiple copies to achieve fault tolerance.

4. Streaming: Apache Spark supports Streaming with very less administration. This

 

makes it much easier to use than Hadoop for real-time stream processing.

5. API: Spark provides a versatile API that can be used with multiple data sources as well as languages. It is more extensible than the API provided by Apache Hadoop.

 

Related questions

0 votes
Jan 11, 2020 in Big Data | Hadoop
0 votes
Jan 11, 2020 in Big Data | Hadoop
...