
Answer-first summary for fast verification
Answer: Data may be colocated by the cluster key within the micro-partitions to improve pruning performance
The correct answer is D because when a CLUSTER BY clause is defined in Snowflake, the immediate effect is that data is physically reorganized to colocate rows with similar cluster key values within the same micro-partitions. This improves pruning performance during queries by allowing Snowflake to skip entire micro-partitions that don't contain the requested data. Option A is incorrect because clustering keys don't hash data for searches - that's more related to indexing. Option B is incorrect because clustering doesn't specifically create larger micro-partitions for common values. Option C is incorrect because clustering doesn't create smaller micro-partitions to increase parallelism. The community discussion strongly supports D with 100% consensus and references to official Snowflake documentation confirming that clustering keys co-locate data in micro-partitions to improve pruning efficiency.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
When a CLUSTER BY clause is defined for a Snowflake table, what is the immediate effect on the physical data in the table?
A
Data is hashed by the cluster key to facilitate fast searches for common data values
B
Larger micro-partitions are created for common data values to reduce the number of partitions that must be scanned
C
Smaller micro-partitions are created for common data values to allow for more parallelism
D
Data may be colocated by the cluster key within the micro-partitions to improve pruning performance