
Explanation:
The correct answer is D.
By default, Databricks Auto Loader operates in Directory Listing mode. In this mode, Auto Loader identifies new files by periodically scanning the input directory and tracking the processed files in a checkpoint. This allows for incremental and idempotent ingestion (ensuring each file is processed exactly once) without requiring complex cloud-native configurations.
cloudFiles.useNotifications to true).Ultimate access to all questions.
No comments yet.
Which statement accurately describes the default execution mode used by Databricks Auto Loader to detect and process new files?
A
It utilizes cloud-specific notification services (e.g., AWS SNS/SQS or Azure Event Grid) and queue storage to track and incrementally load new files.
B
It relies on webhooks to automatically trigger Databricks jobs whenever new files arrive, merging data into target tables using schema inference.
C
It performs a full scan of the source directory during every trigger, recreating the target Delta table by querying all valid files found in the source.
D
It scans the input directory to identify new files, loading them incrementally and idempotently into the target Delta Lake table.