
Answer-first summary for fast verification
Answer: Utilizing the DESCRIBE HISTORY command to identify the corrupted version and the RESTORE command to revert to a previous version
1. **DESCRIBE HISTORY Command**: This command in Delta Lake provides a history of all operations performed on a Delta table, enabling identification of the version where corruption occurred. 2. **Identify the Corrupted Version**: By examining the history, you can determine the exact moment the erroneous data ingestion job was executed. 3. **RESTORE Command**: This feature allows reverting the Delta table to a previous version, specifically to a state before the corruption happened. 4. **Efficient Solution**: The combination of DESCRIBE HISTORY and RESTORE commands offers a straightforward method to address data corruption, ensuring data integrity without manual corrections or complex audits. Option B stands out as the most effective approach for leveraging Delta Lake's time travel feature to restore a table's integrity post-corruption.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
In the event of data corruption in a Delta Lake table caused by a faulty data ingestion job, how can Delta Lake's time travel feature in Databricks be utilized to pinpoint and revert the table to its pre-corruption state?
A
Applying the VERSION AS OF option in a Delta table read operation to audit data changes and manually correct the corrupted data
B
Utilizing the DESCRIBE HISTORY command to identify the corrupted version and the RESTORE command to revert to a previous version
C
Implementing a Spark SQL query with the TIMESTAMP AS OF clause to access historical data and identify discrepancies
D
Leveraging MLflow to track data versioning and rollback the Delta table to a stable state before the corruption