
Answer-first summary for fast verification
Answer: WarehouseID
## Detailed Explanation ### Understanding Partitioning in Azure Synapse Analytics Partitioning in Azure Synapse Analytics dedicated SQL pools is a critical design decision that affects query performance through **partition elimination**. When a query includes the partition key in the WHERE clause, the query optimizer can eliminate entire partitions from the scan, significantly reducing I/O and improving performance. ### Analysis of Usage Patterns The question specifies two key usage patterns: 1. **Primary pattern**: "Analysts will most commonly analyze transactions for a warehouse" 2. **Secondary patterns**: Queries will summarize by product category type, date, and/or inventory event type ### Evaluation of Partition Column Options **A. EventTypeID** - Not optimal because: - Event types are typically low cardinality (few distinct values) - Not mentioned as the primary filtering criteria - Would create too few partitions for effective elimination **B. ProductCategoryTypeID** - Not optimal because: - Similar to EventTypeID, likely low cardinality - Not the primary filtering pattern - Would not align with the most common query pattern **C. EventDate** - Suboptimal because: - While date partitioning is common in data warehousing, it doesn't align with the primary usage pattern - The question emphasizes that warehouse-based analysis is the most common scenario - Partitioning by date would not provide partition elimination for warehouse-specific queries **D. WarehouseID** - **OPTIMAL CHOICE** because: - Directly aligns with the primary usage pattern: "most commonly analyze transactions for a warehouse" - WarehouseID will frequently appear in WHERE clauses, enabling partition elimination - Provides effective data organization for the most common analytical scenarios - Supports the secondary summarization patterns (by product category, date, event type) within warehouse contexts ### Partitioning Best Practices - **Partition elimination** is the primary performance benefit of partitioning - The partition column should be the most common filter in WHERE clauses - For clustered columnstore tables, ensure sufficient rows per partition (minimum 1 million recommended) - Partitioning should align with the dominant query patterns ### Conclusion WarehouseID is the optimal partition column because it directly supports the primary analytical pattern where analysts most commonly query by warehouse. This enables effective partition elimination and minimizes query times for the most frequent use case, while still supporting the secondary summarization requirements.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are designing an inventory updates table with a clustered columnstore index in an Azure Synapse Analytics dedicated SQL pool. The table has the following columns:
The identified usage patterns are:
To minimize query times, on which column should you partition the table?

A
EventTypeID
B
ProductCategoryTypeID
C
EventDate
D
WarehouseID
No comments yet.