Databricks Certified Data Engineer - Professional

Ultimate access to all questions.

A Spark job is executing much slower than expected. Upon examining the Spark UI, a data engineer notices that within a specific stage, the minimum and median task durations are nearly identical, yet the maximum task duration is approximately 100 times longer than the minimum. What is the most likely cause for this performance bottleneck?

Real Exam

Last updated: January 6, 2026 at 15:39

Disk spillover caused by insufficient attached volume storage for temporary data.

0.0%

Data skew resulting from uneven distribution, where certain partitions contain significantly more records than others.

Loading comments...