
Explanation:
Option A is the correct answer because maintaining data quality rules separately from the pipeline follows the best practice of separation of concerns and enables reusability across multiple tables and pipelines.
This approach aligns with Databricks best practices for data quality management in CI/CD workflows.
Ultimate access to all questions.
No comments yet.
Databricks CI/CD Workflows
A data engineer needs to apply a common set of data quality rules to multiple tables. Which of the following best practices can they follow to do this? Select one response.
A
Maintain data quality rules separately from the pipeline
B
Create a separate pipeline containing the data quality rules and run it concurrently with the pipeline
C
Tag the dataset used to populate the tables in the pipeline with data quality rule definitions
D
Create a task in Workflows for data quality rules and make it a dependency of the pipeline