Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
When designing a data model for multi-dimensional data analysis (e.g., OLAP cubes) in a lakehouse, which approach ensures optimal query performance across various aggregation queries?
A
Store raw event data in a flat structure, applying on-the-fly aggregation during each query.
B
Implement a snowflake schema, normalizing dimensions to reduce data redundancy and storage costs.
C
Pre-aggregate data into a set of materialized views covering common query patterns.
D
Utilize a star schema with fact tables and dimension tables, leveraging Delta Lake‘s performance features.