
Explanation:
Spilling occurs when Spark is forced to move data from memory to disk during a shuffle or sort. You can identify this in two main places:
stdout or stderr logs, you can find explicit log entries from the UnsafeExternalSorter (e.g., Spilling data because...). These entries provide task-level confirmation that memory limits were exceeded.Ultimate access to all questions.
Data spilling to disk often occurs during the execution of wide transformations in Apache Spark when memory is insufficient. Which two locations within the Spark UI or logs serve as the primary indicators for identifying that partitions are spilling?
A
The Driver’s log files and the Executor’s log files.
B
The Stage’s detail screen and the SQL Query detail screen.
C
The Stage’s detail screen and the Executor’s log files.
D
The Executor’s detail screen and the Executor’s log files.
No comments yet.