
Ultimate access to all questions.
You are designing a data model in a Databricks lakehouse to support multi-dimensional analytics for a retail company. How would you structure the data to optimize for query performance across numerous dimensions such as time, geography, product, and customer demographics?
A
Store all data in a single, flattened wide table to minimize join operations during query execution.
B
Normalize the data into multiple related tables to reduce redundancy and save on storage costs.
C
Implement a star schema design with dimension tables for time, geography, product, and customer demographics, centering around a sales fact table.
D
Use a snowflake schema but avoid creating dimension tables for highly hierarchical dimensions like geography.