Why do we perform partitioning in Hive?

Question

Why do we perform partitioning in Hive?

1 Answer

SakshiSharma · Answer 1 · 2020-01-10T07:33:59+0000

Partitioning provides granularity in a Hive table and therefore, reduces the query latency by scanning only relevant partitioned data instead of the whole data set.

For example, we can partition a transaction log of an e – commerce website based on month like Jan, February, etc. So, any analytics regarding a particular month, say Jan, will have to scan the Jan partition (sub – directory) only instead of the whole table data.

Why do we perform partitioning in Hive?

Please log in or register to answer this question.

1 Answer