Reddit

A Spark ETL pipeline processes data nightly. One stage requires identifying new records within a Delta Lake table named `bronze` that have not yet been processed downstream. This table is partitioned by `year`, `month`, and `day`. Which of the following designs for the `new_records` function effectively returns a Spark DataFrame containing only the unprocessed data from the `bronze` table? | Databricks Certified Data Engineer - Professional Quiz - LeetQuiz