Databricks Certified Data Engineer - Professional

Get started today

Ultimate access to all questions.

Explanation:

Option A is correct because it describes a practical approach to handling batch data with Delta Lake, leveraging its features like MERGE INTO for deduplication and auto-compaction for incremental processing.

Explanation:

Comments (0)

No comments yet.

You are tasked with implementing a data pipeline that processes batch data from a financial institution. The data needs to be processed incrementally and deduplicated. Describe the architecture and operations necessary to achieve this using Delta Lake with batch workloads.

Simulated

Use a batch processing approach with Delta Lake to handle incremental processing and deduplication.

55.0%

Implement a streaming architecture with Delta Lake using MERGE INTO for deduplication and auto-compaction for incremental processing.

37.8%

Use a combination of Kafka and Delta Lake for batch data without handling deduplication.

5.3%

Implement a custom Python script to handle batch data and deduplication without using Delta Lake.

1.9%