
Ultimate access to all questions.
You are designing a data processing system for a healthcare organization that requires analyzing patient records. The data needs to be partitioned to optimize query performance and support efficient data retrieval. What partitioning strategy would you recommend, and how would you implement it in Azure Data Lake Storage Gen2?
A
Implement a partition strategy based on the patient's age, as it is the most important attribute for query performance.
B
Create a partition strategy based on the patient's medical record ID, allowing for efficient querying of data within specific patient records.
C
Use a hash-based partitioning method to distribute the data evenly across multiple partitions, regardless of the data's characteristics.
D
Do not implement any partition strategy, as the patient record data does not require optimization for query performance.