
Answer-first summary for fast verification
Answer: Scenario #1
The question asks which cluster configuration is most likely to experience garbage collection delays when processing a large DataFrame, given equal compute power (100GB RAM, 200 cores). Scenario #1 has the fewest executors (2) with the largest heap size per executor (50GB). Large heap sizes lead to longer garbage collection pauses because the JVM has to scan and clean more memory at once. The community discussion strongly supports this with 100% consensus on option D (Scenario #1), citing that larger heap spaces per executor result in longer GC times. Other scenarios with more executors and smaller heap sizes can handle garbage collection more efficiently through parallelism and shorter GC cycles.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
Which of the following cluster configurations is most likely to experience delays due to garbage collection when processing a large DataFrame?
Note: Each configuration has roughly the same compute power, utilizing 100GB of RAM and 200 cores.

A
More information is needed to determine an answer.
B
Scenario #5
C
Scenario #4
D
Scenario #1
E
Scenario #2
No comments yet.