in Big Data | Hadoop by
Q:
What is the use of CLUSTERED BY clause during table creation in Hive?

1 Answer

0 votes
by

CLUSTERED BY in Hive is same as DISTRIBUTE BY and SORT

BY. When we specify CLUSTERED BY, it will first distribute the data into different reducers by using a Hash. Once data is distributed, it will sort the data.

 

We have to specify CLUSTERED BY clause during table creation. But it is useful in querying of data in Hive.

Click here to read more about Loan/Mortgage
Click here to read more about Insurance

Related questions

0 votes
asked Apr 3, 2020 in Big Data | Hadoop by Tate
0 votes
asked Apr 15, 2020 in Robotic Process Automation by SakshiSharma
...