Jan 12, 2020 in Big Data | Hadoop
Q: What is the optimization that can be done in SELECT * query in Hive?

1 Answer

0 votes
Jan 12, 2020

We can convert some of the SELECT queries in Hive into single FETCH  task. With this optimization, latency of SELECT query is decreased.

To use this we have to set the value of hive.fetch.task.conversion


parameter. The permissible values are:

0: It means FETCH is disabled.

1: It is minimal mode. SELECT *, FILTER on

partition columns (WHERE and HAVING clauses), LIMIT only

2: It is more mode: SELECT, FILTER,          LIMIT        only

(including virtual columns) "more" can even take UDF expressions in the SELECT clause.


Related questions

0 votes
Jun 7, 2020 in Hive
0 votes
Jan 13, 2020 in Big Data | Hadoop