
Answer-first summary for fast verification
Answer: Partition data by a frequently queried column, improves query performance and storage efficiency.
Option B is the optimal approach. Partitioning data by a frequently queried column can significantly improve query performance and storage efficiency, as it allows for more targeted data access and reduces the amount of data that needs to be scanned.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You need to implement file partitioning for a large dataset in your lakehouse to improve query performance. Describe the steps you would take to partition the data, including the selection of partition keys and the impact of this approach on storage and query efficiency.
A
Store all data in a single file, no partitioning needed.
B
Partition data by a frequently queried column, improves query performance and storage efficiency.
C
Partition data randomly, no specific strategy.
D
Partition data only by date, regardless of query patterns.
No comments yet.