in Big Data | Hadoop by
What is the optimization that can be done in SELECT * query in Hive?

1 Answer

0 votes

We can convert some of the SELECT queries in Hive into single FETCH  task. With this optimization, latency of SELECT query is decreased.

To use this we have to set the value of hive.fetch.task.conversion


parameter. The permissible values are:

0: It means FETCH is disabled.

1: It is minimal mode. SELECT *, FILTER on

partition columns (WHERE and HAVING clauses), LIMIT only

2: It is more mode: SELECT, FILTER,          LIMIT        only

(including virtual columns) "more" can even take UDF expressions in the SELECT clause.


Related questions

+1 vote
asked Apr 1, 2020 by AdilsonLima
0 votes
asked Jan 13, 2020 in Big Data | Hadoop by sharadyadav1986
0 votes
asked Apr 6, 2020 in Big Data | Hadoop by GeorgeBell