Categories

Jan 12 in Big Data | Hadoop

Q: What is the use of CLUSTERED BY clause during table creation in Hive?

1 Answer

Jan 12

CLUSTERED BY in Hive is same as DISTRIBUTE BY and SORT

BY. When we specify CLUSTERED BY, it will first distribute the data into different reducers by using a Hash. Once data is distributed, it will sort the data.

 

We have to specify CLUSTERED BY clause during table creation. But it is useful in querying of data in Hive.

Click here to read more about Loan/Mortgage
Click here to read more about Insurance

Related questions

Madanswer
Apr 3 in Big Data | Hadoop
Jan 10 in Big Data | Hadoop
Jan 12 in Big Data | Hadoop
...