Google Professional Data Engineer

Google Professional Data Engineer

Get started today

Ultimate access to all questions.


You have set up an external table in Apache Hive for partitioned data stored in a Cloud Storage bucket, which contains a significant volume of files. However, you observe that the queries executed on this table are performing slowly. In order to enhance the query performance, what actions should you take?




Explanation:

The correct answer is C. Upgrading the external table to a BigLake table and enabling metadata caching is the most effective way to improve query performance for Apache Hive partitioned data in a Cloud Storage bucket. BigLake tables handle large datasets more efficiently than standard external tables, and metadata caching reduces the time taken to read and process metadata from a large number of files, thus significantly improving query performance.