in Big Data | Hadoop by
How does inter cluster data copying works in Hadoop?

1 Answer

0 votes
by

In Hadoop, there is a utility called DistCP (Distributed Copy) to perform large inter/intra-cluster copying of data. This utility is also based on MapReduce. It creates Map tasks for files given as input.

After every copy using DistCP, it is recommended to run crosschecks to confirm that there is no data corruption

 

and copy is complete.

Related questions

0 votes
asked Jan 7, 2020 in Big Data | Hadoop by sharadyadav1986
0 votes
asked Nov 24, 2020 in HDFS by rahuljain1
0 votes
asked Sep 7, 2019 in Big Data | Hadoop by john ganales
...