0 votes
in Big Data | Hadoop by
What is the use of CLUSTERED BY clause during table creation in Hive?

1 Answer

0 votes
by

CLUSTERED BY in Hive is same as DISTRIBUTE BY and SORT

BY. When we specify CLUSTERED BY, it will first distribute the data into different reducers by using a Hash. Once data is distributed, it will sort the data.

 

We have to specify CLUSTERED BY clause during table creation. But it is useful in querying of data in Hive.

Related questions

+1 vote
asked Jan 12, 2020 in Big Data | Hadoop by sharadyadav1986
0 votes
asked Apr 1, 2020 in Big Data | Hadoop by AdilsonLima
...