Databricks Certified Data Engineer - Professional

Ultimate access to all questions.

A data engineer discovers that a critical field from a Kafka source was omitted during the ingestion process into a Delta Lake pipeline. While the field was present in the Kafka source, it is now missing from downstream storage. Kafka has a seven-day retention period, but the pipeline has been running for three months.

How can Delta Lake be used to prevent this type of permanent data loss in the future?

Real Exam

Last updated: January 6, 2026 at 15:39

Ingesting all raw data and metadata into a Bronze Delta table to create a permanent, replayable history of the data state.

61.5%

Loading comments...

Relying on the Delta log and Structured Streaming checkpoints, which maintain a complete history of the Kafka producer’s records.

23.1%