Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.

In a scenario where you are tasked with optimizing a Delta Lake table for ad-hoc querying in a cost-sensitive environment, which of the following actions would you take to significantly improve query performance while ensuring data integrity and compliance with data governance policies? Consider the need for scalability and the potential impact on operational costs. Choose the best option.

Simulated

Increase the file size to 1GB to reduce the number of files, thereby improving query performance but potentially increasing the time for data recovery in case of failures.

15.2%

Add all possible columns to the table to maximize storage utilization, ignoring the impact on query performance and storage costs.

1.5%

Comments

Loading comments...

Implement columnar storage and predicate pushdown to enhance query performance by reducing the amount of data scanned and leveraging the query engine's ability to push down predicates, without compromising data integrity.

79.5%