0 votes
in Hadoop by
What is HDFS- Hadoop Distributed File System?

1 Answer

0 votes
by

What is HDFS- Hadoop Distributed File System?

Hadoop distributed file system (HDFS) is the primary storage system of Hadoop. HDFS stores very large files running on a cluster of commodity hardware. It works on the principle of storage of less number of large files rather than the huge number of small files. HDFS stores data reliably even in the case of hardware failure. It provides high throughput access to the application by accessing in parallel.

Components of HDFS:

NameNode – It is also known as Master node. Namenode stores meta-data i.e. number of blocks, their replicas and other details.

DataNode – It is also known as Slave. In Hadoop HDFS, DataNode is responsible for storing actual data. DataNode performs read and write operation as per request for the clients in HDFS.

Read Hadoop HDFS in detail

Related questions

0 votes
asked Jun 26, 2023 in Hadoop by Robindeniel
0 votes
asked Jun 7, 2020 in Hadoop by Robindeniel
...