in Big Data | Hadoop by (2.6k points)

How can we delete duplicate rows from flat files?

1 Answer

0 votes
by (2.6k points)

We can delete duplicate rows from flat files by leveraging the sorter transformation and selecting the distinct option. Selecting this option will delete the duplicate rows.

Related questions

0 votes
asked Mar 10, 2020 in Big Data | Hadoop by Hodge (2.6k points)
...