Ultimate access to all questions.
In a scenario where you are managing a Delta Lake table that has accumulated a large number of small files due to frequent updates and deletes, you need to optimize both storage efficiency and query performance. Considering the constraints of minimizing operational overhead and ensuring compliance with data retention policies, which of the following actions should you take? Additionally, what is the primary benefit of using the OPTIMIZE
command in this context? Choose the best option that describes the role of the OPTIMIZE
command and the type of files it affects. (Choose one correct answer)