+1 vote
in Hive by

Suppose there are several small CSV files present in /user/input directory in HDFS and you want to create a single Hive table from these files. The data in these files have the following fields: {registration_no, name, email, address}. What will be your approach to solve this, and where will you create a single Hive table for multiple smaller files without degrading the performance of the system?

1 Answer

0 votes
by

Using SequenceFile format and grouping these small files together to form a single sequence file can solve this problem. Below are the steps:

Related questions

+1 vote
asked Jun 14, 2020 in Hive by SakshiSharma
0 votes
asked Jun 7, 2020 in Hive by Robindeniel
...