
Answer-first summary for fast verification
Answer: Scenario 1
Scenario 1 will induce the least network traffic during a shuffle operation because it uses a single node with all 200 cores and 100GB RAM. Since all data processing occurs within a single node, shuffle operations do not require data transfer across the network between different nodes. In contrast, all other scenarios involve multiple nodes, which would require network traffic during shuffle operations as data needs to be exchanged between nodes. The community discussion confirms this reasoning with 100% consensus on answer C, noting that 'all the data would stay within the single node, avoiding any cross-node traffic.'
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
Given the following cluster configurations with equivalent compute power (100GB RAM and 200 cores), which one will generate the least network traffic during a shuffle operation?
//IMG//

A
This cannot be determined without knowing the number of partitions.
B
Scenario 5
C
Scenario 1
D
Scenario 4
E
Scenario 6
No comments yet.