
Explanation:
The OPTIMIZE command in Delta Lake is specifically designed for consolidating small files into larger ones. This reorganization enhances query performance by minimizing the number of files that need to be accessed, while also optimizing storage efficiency. Unlike other options, OPTIMIZE directly tackles the issue of small file consolidation without the explicit deletion of data, making it the optimal solution for improving table performance in Delta Lake environments.
Ultimate access to all questions.
A data engineer observes that the data files associated with a Delta table are unusually small, resulting in suboptimal performance. To improve query efficiency, they plan to merge these small files into fewer, larger ones. Which keyword should be employed to accomplish this file consolidation?
A
VACUUM
B
REPARTITION
C
OPTIMIZE
D
COMPACTION
E
REDUCE
No comments yet.