
Answer-first summary for fast verification
Answer: Utilize Delta table constraints to enforce data quality rules.
The optimal solution for ensuring data quality in Delta tables is to utilize Delta table constraints. This approach enforces data quality rules directly within the Delta table schema, automatically rejecting any data that violates these constraints during write operations. This method is both efficient and automated, providing a robust solution for maintaining data integrity. Other options, such as using the UPDATE statement or exporting invalid data for manual review, are either reactive or add unnecessary complexity without guaranteeing data quality at the source.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
A data engineering team aims to maintain high data quality in Databricks Delta tables by ensuring only valid data is written, rejecting any that fails to meet validation criteria. Which Databricks feature or option can best help achieve this goal?
A
Export the invalid data to a separate file for manual review.
B
Use the UPDATE statement to modify invalid data.
C
Utilize Delta table constraints to enforce data quality rules.
D
Implement a Databricks Job to perform data validation before writing to Delta tables.
No comments yet.