
Ultimate access to all questions.
Given a dataset of weather readings from various stations, which includes columns like station_id, temperature, humidity, and timestamp. Describe how you would apply Delta Lake optimizations such as partitioning, z-ordering, and bloom filters to this dataset to enhance query performance. Consider the typical query patterns and the size of the dataset.
A
Partition by station_id, z-order by timestamp, apply bloom filters on temperature.
B
Partition by temperature, z-order by station_id, apply bloom filters on humidity.
C
Partition by timestamp, z-order by temperature, apply bloom filters on station_id.
D
Partition by humidity, z-order by timestamp, apply bloom filters on temperature.