
Explanation:
Z-ordering (multi-dimensional clustering) is a technique used in Delta Lake to colocate related information in the same set of files. This data locality is particularly effective for optimizing filtering and query performance on non-partitioned columns, as it allows the engine to skip irrelevant data more effectively during query execution.
Ultimate access to all questions.
In the context of Delta Lake performance optimization, which of the following best describes what Z-ordering optimizes?
A
It enhances filtering efficiency specifically for partition keys.
B
It improves filtering and query performance for non-partitioned columns by colocating related data within files.
C
It is primarily used to reduce file sizes to increase parallelism across the cluster.
D
It optimizes shuffle operations when joining tables on high-cardinality columns.
No comments yet.