in Hive by
Q:

Suppose there are several small CSV files present in /user/input directory in HDFS and you want to create a single Hive table from these files. The data in these files have the following fields: {registration_no, name, email, address}. What will be your approach to solve this, and where will you create a single Hive table for multiple smaller files without degrading the performance of the system?

► Click here to show 1 Answer

0 votes
by

Using SequenceFile format and grouping these small files together to form a single sequence file can solve this problem. Below are the steps:

Apache HDFS
Learn More with Madanswer

Related questions

0 votes
asked Jun 14, 2020 in Hive by Robindeniel
...