0 votes
in Hadoop by
Why HDFS stores data using commodity hardware despite the higher chance of failures?

1 Answer

0 votes
by

Why HDFS stores data using commodity hardware despite the higher chance of failures?

HDFS stores data using commodity hardware because HDFS is highly fault-tolerant. HDFS provides fault tolerance by replicating the data blocks. And then distribute it among different DataNodes across the cluster. By default, replication factor is 3 which is configurable. Replication of data solves the problem of data loss in unfavorable conditions. And unfavorable conditions are crashing of the node, hardware failure and so on. So, when any machine in the cluster goes down, then the client can easily access their data from another machine. And this machine contains the same copy of data blocks.

Learn: Automatic Failover in Hadoop

Related questions

+3 votes
asked Nov 25, 2022 in Big Data | Hadoop by john ganales
0 votes
asked Feb 23, 2020 in Big Data | Hadoop by rahuljain1
...