
Answer-first summary for fast verification
Answer: Upgrade the external table to a BigLake table. Enable metadata caching for the table.
The correct answer is C. Upgrading the external table to a BigLake table and enabling metadata caching is the most effective way to improve query performance for Apache Hive partitioned data in a Cloud Storage bucket. BigLake tables handle large datasets more efficiently than standard external tables, and metadata caching reduces the time taken to read and process metadata from a large number of files, thus significantly improving query performance.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You have set up an external table in Apache Hive for partitioned data stored in a Cloud Storage bucket, which contains a significant volume of files. However, you observe that the queries executed on this table are performing slowly. In order to enhance the query performance, what actions should you take?
A
Change the storage class of the Hive partitioned data objects from Coldline to Standard.
B
Create an individual external table for each Hive partition by using a common table name prefix. Use wildcard table queries to reference the partitioned data.
C
Upgrade the external table to a BigLake table. Enable metadata caching for the table.
D
Migrate the Hive partitioned data objects to a multi-region Cloud Storage bucket.
No comments yet.