
Ultimate access to all questions.
A data engineer has realized that the data files associated with a Delta table are incredibly small. They want to compact the small files to form larger files to improve performance.
Which keyword can be used to compact the small files?
A
OPTIMIZE
B
VACUUM
C
COMPACTIOON
D
REPARTITION
Explanation:
The correct answer is OPTIMIZE.
OPTIMIZE on a Delta table, it rewrites small files into larger, more efficient files.REPARTITION is a Spark transformation that redistributes data across partitions, but OPTIMIZE is the dedicated command for file compaction in Delta Lake.OPTIMIZE to improve query performance by compacting small filesOPTIMIZE table_nameOPTIMIZE table_name WHERE to optimize specific partitions