Microsoft Azure Data Engineer Associate - DP-203

Get started today

Ultimate access to all questions.

You are tasked with optimizing the performance of a data processing system that handles large volumes of time-series data. The data needs to be partitioned to support efficient data retrieval and analysis. What partitioning strategy would you recommend, and how would you implement it in Azure Data Lake Storage Gen2?

Simulated

Implement a partition strategy based on the data's size, as this is the most important attribute for time-series data performance.

0.0%

Create a partition strategy based on the time interval of the data, such as hourly, daily, or monthly partitions, to improve query performance and reduce the amount of data scanned.

Comments

Loading comments...

Use a hash-based partitioning method to distribute the data evenly across multiple partitions, regardless of the data's characteristics.