
Ultimate access to all questions.
Consider a scenario where a Spark application is experiencing high latency during the execution of a batch job. Using the Spark UI, you notice that the event timeline shows a significant number of long-running tasks in a specific stage. What steps would you take to diagnose and potentially resolve this performance issue?
A
Increase the number of executors and reduce the task parallelism.
B
Check the data skew by analyzing the task metrics and repartition the data if necessary.
C
Decrease the memory allocated to the executors.
D
Ignore the long-running tasks as they might be due to external factors.