
Explanation:
Correct Answer: B
By default, when data is written into a Delta table, Delta Lake collects file-level statistics—such as minimum, maximum, null counts, and row counts—for the first 32 columns defined in the table schema. During query execution, the Delta engine leverages these statistics to perform data skipping, bypassing Parquet files whose values fall entirely outside the query's filter predicates. This significantly reduces I/O and accelerates query performance.
Why the other options are incorrect:
Ultimate access to all questions.
Which of the following statements regarding Delta Lake features and behaviors is true?
A
Delta Lake enforces primary and foreign key constraints to prevent duplicate records and maintain referential integrity within dimension tables.
B
Delta Lake automatically collects statistics on the first 32 columns of each table, which are utilized for data skipping based on query filters.
C
Z-ordering in Delta Lake is a data organization technique that is strictly limited to numeric columns.
D
Standard views in the Lakehouse always maintain an up-to-date, physical cache of the latest versions of their source tables to optimize performance.
No comments yet.