Databricks Certified Data Engineer - Professional

Ultimate access to all questions.

A data engineer is analyzing a Spark job that is taking significantly longer than expected. Upon reviewing the Spark UI, they notice that the minimum and median completion times for tasks in a specific stage are nearly identical. However, the maximum task duration is approximately 100 times longer than the minimum.

What is the most likely cause of this performance discrepancy?

Real Exam

Last updated: January 6, 2026 at 15:41

Network latency resulting from cluster nodes being deployed in a different region than the source data storage.

0.0%

Credential validation delays occurring during the retrieval of data from an external system, leading to authentication retries.

Loading comments...