
Answer-first summary for fast verification
Answer: Partition data by a frequently queried column, improves query performance and manageability.
Option B is the optimal approach. Partitioning data by a frequently queried column can significantly improve query performance and manageability, as it allows for more targeted data access and reduces the amount of data that needs to be scanned.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are working with a large transactional dataset that needs to be partitioned for efficient querying in your lakehouse. Describe the process of implementing file partitioning and discuss the benefits of using this approach over storing data in a single file.
A
Store all data in a single large file, no partitioning needed.
B
Partition data by a frequently queried column, improves query performance and manageability.
C
Partition data randomly, no specific strategy.
D
Partition data only by date, regardless of query patterns.
No comments yet.