
Answer-first summary for fast verification
Answer: The number of data files that are processed in parallel is determined by the virtual warehouse.
Option D is correct because Snowflake's parallel processing capability for data loading is directly determined by the virtual warehouse size and compute capacity. Larger warehouses can process more files simultaneously, enabling faster data loading. This is supported by the community consensus (91% selected D, with multiple upvoted comments explaining that warehouse resources dictate parallelism). Option A is incorrect as Snowflake recommends moderately sized files (100 MB to 1 GB compressed) to minimize metadata overhead, not small files. Option B is incorrect because excessively large files can reduce parallelism and efficiency. Option C is misleading as the number of parallel load operations is constrained by warehouse resources, not independent of file count.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
When loading data into Snowflake, what is a key consideration regarding the staging of files?
A
Create small data files and stage them in cloud storage frequently.
B
Create large data files to maximize the processing overhead for each file.
C
The number of load operations that run in parallel can exceed the number of data files to be loaded.
D
The number of data files that are processed in parallel is determined by the virtual warehouse.