Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.

In the context of designing a data pipeline for a financial services company that processes high-volume transactions from multiple sources into a Delta Lake table, which of the following approaches BEST ensures data quality and consistency while adhering to regulatory compliance and scalability requirements? (Choose one option)

Simulated

Relying solely on a single source of truth and minimizing data duplication without implementing any additional data quality checks.

5.7%

Incorporating comprehensive data quality checks and validation logic at each processing stage, including schema validation, null checks, and custom business rule validations.

Comments

Loading comments...

Utilizing Delta Lake's ACID transactions and MERGE INTO statements for upserts to ensure data consistency, without explicit data quality checks.

31.2%

Denormalizing all data and eliminating lookup tables to simplify the pipeline, assuming this will inherently ensure data quality and consistency.

5.0%