
Ultimate access to all questions.
You need to implement file partitioning for a large dataset in your lakehouse to improve query performance. Describe the steps you would take to partition the data, including the selection of partition keys and the impact of this approach on storage and query efficiency.
A
Store all data in a single file, no partitioning needed.
B
Partition data by a frequently queried column, improves query performance and storage efficiency.
C
Partition data randomly, no specific strategy.
D
Partition data only by date, regardless of query patterns.