
Ultimate access to all questions.
Given a Delta table named ‘products’ with the schema: name (STRING), category (STRING), expiration_date (DATE), price (FLOAT). When executing the query SELECT * FROM products WHERE price > 30.5, which of the following mechanisms will the query optimizer use to identify the data files to load?*_
A
Columns statistics in the Hive metastore
B
Files statistics in the Delta transaction log
C
Columns statistics in the metadata of Parquet files
D
Files statistics in the Hive metastore
E
None of the above. All data files are fully scanned to identify the ones to load