
Explanation:
Delta Lake automatically collects metadata—including minimum and maximum values—for the first 32 columns defined in a table's schema. This metadata is leveraged during data skipping, an optimization that allows the engine to skip reading entire files that do not contain relevant data based on query filters.
Key clarifications:
MERGE or other logic.Ultimate access to all questions.
Which statement accurately describes the features of Delta Lake and the Lakehouse architecture?
A
Parquet utilizes row-by-row compression, meaning that string data is only compressed when specific characters are repeated multiple times.
B
Primary and foreign key constraints are strictly enforced by the Delta engine to prevent the insertion of duplicate values into dimension tables.
C
Delta Lake automatically gathers statistics on the first 32 columns of each table, which are used for data skipping to optimize query performance.
D
Views in the Lakehouse architecture maintain a continuous, valid cache of the most recent versions of all underlying source tables.
No comments yet.