
Explanation:
The correct answer is B. The OPTIMIZE command in Databricks Delta Lake is used to compact small files into larger ones, thus improving performance. This process is known as file compaction and helps to enhance query performance by reducing the number of files that the engine needs to scan.
Ultimate access to all questions.
A data engineer working with a Delta table has observed that the associated data files are extremely small, leading to potential performance issues. To enhance performance by compacting these small files into larger ones, which of the following keywords should the data engineer use?
A
REDUCE
B
OPTIMIZE
C
COMPACTION
D
REPARTITION
E
VACUUM
No comments yet.