Microsoft Azure Data Engineer Associate - DP-203

Ultimate access to all questions.

You are designing a data pipeline for a financial services company that requires processing large volumes of transactional data. The data needs to be partitioned to optimize query performance and support efficient data retrieval. What partitioning strategy would you recommend, and how would you implement it in Azure Data Lake Storage Gen2?

Simulated

Implement a partition strategy based on the transaction amount, as it is the most important attribute for query performance.

12.5%

Create a partition strategy based on the transaction date and time, allowing for efficient querying of data within specific time ranges.

Loading comments...

Use a round-robin partitioning method to distribute the data evenly across multiple partitions, regardless of the data's characteristics.

12.5%