Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
You are optimizing a lakehouse for complex time-based queries against a decade of historical data, with a focus on ensuring optimal performance for queries on recent data (up to one month old). How should you structure your partitions?
A
Use a flat partitioning scheme based on ingestion time, relying on the lakehouse's automatic optimization features to handle query performance.
B
Create separate tables for historical and recent data, with recent data partitioned by day and historical data by year, and use a view to unify them for querying.
C
Partition data by year, then by month, applying Z-ordering on the most queried columns within the recent data partition.
D
Implement a hybrid partitioning scheme that partitions recent data daily and historical data monthly to optimize for query access patterns.