
Ultimate access to all questions.
You are tasked with optimizing the performance of a data processing system that handles large volumes of log data. The data needs to be partitioned to support efficient data retrieval and analysis. What partitioning strategy would you recommend, and how would you implement it in Azure Data Lake Storage Gen2?
A
Implement a partition strategy based on the log data's size, as this is the most important attribute for query performance.
B
Create a partition strategy based on the log data's timestamp, allowing for efficient querying of data within specific time ranges.
C
Use a hash-based partitioning method to distribute the data evenly across multiple partitions, regardless of the data's characteristics.
D
Do not implement any partition strategy, as the log data does not require optimization for query performance.