
Ultimate access to all questions.
A dataset has been defined using Delta Live Tables and includes an expectations clause: CONSTRAINT valid_timestamp EXPECT (timestamp > '2020-01-01') ON VIOLATION DROP ROW What is the expected behavior when a batch of data containing data that violates these constraints is processed?
A
Records that violate the expectation are dropped from the target dataset and loaded into a quarantine table.
B
Records that violate the expectation are added to the target dataset and flagged as invalid in a field added to the target dataset.
C
Records that violate the expectation are dropped from the target dataset and recorded as invalid in the event log.
D
Records that violate the expectation are added to the target dataset and recorded as invalid in the event log.
E
Records that violate the expectation cause the job to fail.
Explanation:
When using Delta Live Tables with the ON VIOLATION DROP ROW clause, the expected behavior is:
ON VIOLATION FAIL UPDATE or other options, DROP ROW simply removes invalid records without creating separate quarantine tables.This behavior is documented in Databricks Delta Live Tables expectations documentation. The ON VIOLATION DROP ROW clause specifically means that violating rows are excluded from the target dataset but information about them is recorded in logs for observability.
Why other options are incorrect:
DROP ROW