Categories

Jan 12 in Big Data | Hadoop
Q: What is the use of ORC format tables in Hive?

1 Answer

Jan 12

We use Optimized Row Columnar (ORC) file format to store data efficiently in Hive. It is used for performance improvement in reading, writing and processing of data.

In ORC format, we can overcome the limitations of other Hive file formats. Some of the advantages of ORC format are:

 

There is single file as the output of each task. This reduces load on NameNode.

It supports date time, decimal, struct, map etc complex types.

It stores light-weight indexes within the file.

We can bound the memory used in read/write of data.

It stores metadata with Protocol Buffers that supports add/remove of fields.

 

Click here to read more about Loan/Mortgage
Click here to read more about Insurance

Related questions

Madanswer
Apr 3 in Big Data | Hadoop
Jan 12 in Big Data | Hadoop
Jun 7 in Hive
...