
Ultimate access to all questions.
You are tasked with designing a Delta Lake for storing streaming event data ingested via Databricks, with expectations of significant data growth. What strategy would you employ to optimize the Delta Lake table for both query performance and efficient data management?
A
Store all data in a single partition to simplify data management and rely on Delta Lake‘s automatic optimization.
B
Partition the data by event type and ensure all queries include a filter on this partition key.
C
Use a single column for partitioning and avoid Z-ordering to keep the data model simple.
D
Partition the data by event timestamp at a daily granularity and employ Z-ordering on frequently queried columns.