
Ultimate access to all questions.
A Delta table containing weather records is partitioned by the date column. The schema includes date, device_id, temp, latitude, and longitude. A data engineer executes a query to retrieve records from the Arctic Circle using the filter latitude > 66.3.
What mechanism does the Delta engine use to identify which specific Parquet files must be loaded to satisfy this query?_
A
The Delta engine scans the Delta transaction log (_delta_log) to retrieve the minimum and maximum statistics for the latitude column.
B
The Delta engine scans the footers of all Parquet files in the table to extract the minimum and maximum statistics for the latitude column.
C
The Delta engine retrieves the minimum and maximum statistics for the latitude column from the Hive metastore's partition metadata.
D
All records are first cached to the cluster's attached storage, and the filter condition is subsequently applied to the cached data.
E
All records are first loaded into an operational database where the filter is applied as a post-processing step.