in Big Data | Hadoop by
Cluster By

hive> select id, name from person cluster by id;

When this query is executed, it will provide results to multiple reducers. If you have a set of columns and you are performing sort by and distribute by, you can replace it by cluster by. It sends records from a certain key column to the same reducer to cluster it.

...