
Ultimate access to all questions.
You are analyzing the performance of a Spark application using the Spark UI and notice that the event timeline shows a significant gap between the completion of one stage and the start of the next. What could be the cause of this gap, and how would you address it?
A
The application is waiting for resources; increase the number of executors.
B
There is a data dependency issue; optimize the data flow between stages.
C
The application is experiencing a shuffle spill; optimize the shuffle operations.
D
The application is performing complex computations; increase the task parallelism.