
Explanation:
To minimize daily data transfer between Azure Blob Storage and Azure Data Lake Storage Gen2, the solution must focus on incremental loading - transferring only new or modified files each day rather than all files.
A: Specify a file naming pattern for the destination
C: Filter by the last modified date of the source files
B: Delete the files in the destination before loading the data
D: Delete the source files after they are copied
The combination of file naming patterns (A) and date filtering (C) creates an efficient incremental loading solution:
Ultimate access to all questions.
No comments yet.
You are designing a solution to copy daily Parquet files from an Azure Blob Storage account to an Azure Data Lake Storage Gen2 account. The destination folder structure is {Year}/{Month}/{Day}/. The goal is to minimize the daily data transfer between the two accounts.
Which two configurations should you include in the design of the Azure Data Factory data load? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
A
Specify a file naming pattern for the destination.
B
Delete the files in the destination before loading the data.
C
Filter by the last modified date of the source files.
D
Delete the source files after they are copied.