Ultimate access to all questions.
In a scenario where you are tasked with optimizing a Delta Lake table for ad-hoc querying in a cost-sensitive environment, which of the following actions would you take to significantly improve query performance while ensuring data integrity and compliance with data governance policies? Consider the need for scalability and the potential impact on operational costs. Choose the best option.
Explanation:
The correct answer is C because implementing columnar storage and predicate pushdown optimizes query performance by minimizing the data scanned during queries and efficiently utilizing the query engine's capabilities, all while maintaining data integrity and compliance. Option A might improve performance but at the risk of longer recovery times and potential inefficiencies in data access. Option B is counterproductive as it increases storage costs and degrades performance by scanning unnecessary data. Option D is detrimental as it removes the ACID properties critical for data integrity in Delta Lake.