Jan 11 in Big Data | Hadoop
Q: How does inter cluster data copying works in Hadoop?

1 Answer

Jan 11

In Hadoop, there is a utility called DistCP (Distributed Copy) to perform large inter/intra-cluster copying of data. This utility is also based on MapReduce. It creates Map tasks for files given as input.

After every copy using DistCP, it is recommended to run crosschecks to confirm that there is no data corruption


and copy is complete.

Click here to read more about Loan/Mortgage
Click here to read more about Insurance

Related questions

Feb 23 in Big Data | Hadoop
Nov 6 in Hadoop
Sep 7, 2019 in Big Data | Hadoop