0 votes
in HDFS by

Define Hadoop Archives? What is the command for archiving a group of files in HDFS.

1 Answer

0 votes
by

Hadoop Archive was introduced to cope up with the problem of increasing memory usage of the NameNode for storing the metadata information because of too many small files. Basically, it allows us to pack a number of small HDFS files into a single archive file and therefore, reducing the metadata information. The final archived file follows the .har extension and one can consider it as a layered file system on top of HDFS. 

Related questions

0 votes
asked Dec 21, 2022 in HDFS by Robin
+1 vote
asked Jun 8, 2020 in HDFS by Robindeniel
...