
Answer-first summary for fast verification
Answer: delta.dataSkippingNumIndexedCols
Data Skipping enhances query processing speed in Delta tables. The default setting for `delta.dataSkippingNumIndexedCols` is 32, limiting statistics collection to the first 32 columns. Adjusting this value upwards increases the number of columns for which statistics are gathered but also raises the overhead during data writes. More details can be found in the documentation on Data Skipping using Z-order.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
To ensure statistics are collected for the first 50 columns of a Delta table with 73 columns, which property needs to be modified to affect future appends?
A
delta.dataSkippingCols
B
The default value is 50; no need to change any property.
C
spark.sql.delta.dataSkippingNumIndexedCols
D
delta.dataSkippingNumIndexedCols
E
spark.sql.dataSkippingNumIndexedCols
No comments yet.