
Ultimate access to all questions.
A data engineer has realized that the data files associated with a Delta table are incredibly small. They want to compact the small files to form larger files to improve performance.
Which keyword can be used to compact the small files?
A
OPTIMIZE
B
VACUUM
C
COMPACTION
D
REPARTITION
Explanation:
The correct answer is A. OPTIMIZE.
Why OPTIMIZE is correct:
OPTIMIZE command is specifically designed to compact small files into larger files to improve read performance.OPTIMIZE reorganizes the data layout by merging small files into larger, more efficient files while maintaining data consistency.Why other options are incorrect:
OPTIMIZE.OPTIMIZE is the dedicated command for this purpose in Delta Lake.Example usage:
OPTIMIZE table_name
OPTIMIZE table_name
Additional notes:
OPTIMIZE can be run on the entire table or on specific partitions.