
Answer-first summary for fast verification
Answer: OPTIMIZE
The correct answer is B. The OPTIMIZE command in Databricks Delta Lake is used to compact small files into larger ones, thus improving performance. This process is known as file compaction and helps to enhance query performance by reducing the number of files that the engine needs to scan.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
A data engineer working with a Delta table has observed that the associated data files are extremely small, leading to potential performance issues. To enhance performance by compacting these small files into larger ones, which of the following keywords should the data engineer use?
A
REDUCE
B
OPTIMIZE
C
COMPACTION
D
REPARTITION
E
VACUUM