0 votes
in Big Data | Hadoop by

Why is Checkpointing Important in Hadoop?

1 Answer

0 votes
by

As more and more files are added the namenode creates large edit logs. Which can substantially delay NameNode startup as the NameNode reapplies all the edits. Checkpointing is a process that takes an fsimage and edit log and compacts them into a new fsimage. This way, instead of replaying a potentially unbounded edit log, the NameNode can load the final in-memory state directly from the fsimage. This is a far more efficient operation and reduces NameNode startup time.

Related questions

0 votes
asked Jan 11, 2020 in Big Data | Hadoop by rajeshsharma
+1 vote
asked Jan 29, 2022 in Big Data | Hadoop by sharadyadav1986
...