CLUSTERED BY in Hive is same as DISTRIBUTE BY and SORT
BY. When we specify CLUSTERED BY, it will first distribute the data into different reducers by using a Hash. Once data is distributed, it will sort the data.
We have to specify CLUSTERED BY clause during table creation. But it is useful in querying of data in Hive.