Databricks Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

Explanation:

Auto Loader is designed to incrementally and efficiently process new data files as they arrive in cloud storage. It can detect new files since the previous run and ingest only those files, making it suitable for this scenario where files accumulate in a shared directory and only new files need to be ingested with each run.

Explanation:

Comments (0)

No comments yet.

A data engineer is tasked with creating an efficient data pipeline. The source system continuously generates files in a shared directory that is utilized by multiple processes. Consequently, the files should remain unchanged and will accumulate in this directory over time. The data engineer must determine which files have been newly added since the last pipeline run and configure the pipeline to exclusively ingest these new files in every subsequent run. Which of the following tools can the data engineer use to address this requirement?

Exam-Like

Unity Catalog

6.7%

Delta Lake

8.2%

Databricks SQL

4.0%

Data Explorer

7.7%

Auto Loader

73.4%