
Ultimate access to all questions.
A dataset has been defined using Delta Live Tables and includes an expectations clause: CONSTRAINT valid_timestamp EXPECT (timestamp > '2020-01-01') ON VIOLATION DROP ROW What is the expected behavior when a batch of data containing data that violates these constraints is processed?
A
Records that violate the expectation are dropped from the target dataset and loaded into a quarantine table.
B
Records that violate the expectation are added to the target dataset and flagged as invalid in a field added to the target dataset.
C
Records that violate the expectation are dropped from the target dataset and recorded as invalid in the event log.
D
Records that violate the expectation are added to the target dataset and recorded as invalid in the event log.
E
Records that violate the expectation cause the job to fail.
Explanation:
In Delta Live Tables (DLT), when you define a constraint with ON VIOLATION DROP ROW, the behavior is:
Let's break down the options:
ON VIOLATION FAIL UPDATE behavior where invalid records are added with a flag.ON VIOLATION DROP ROW behavior: dropped from target dataset and recorded in event log.DROP ROW.ON VIOLATION FAIL UPDATE behavior, not DROP ROW.Key DLT Constraint Behaviors:
ON VIOLATION DROP ROW: Drop violating rows, record in event logON VIOLATION FAIL UPDATE: Fail the pipeline if violations exceed thresholdON VIOLATION clause: Records are added with metadata about violationsThis behavior allows data quality monitoring while ensuring only valid data enters the target dataset.