Databricks Certified Associate Developer for Apache Spark

Get started today

Ultimate access to all questions.

Explanation:

Stage boundaries in Apache Spark are induced by operations that require data shuffling. Shuffles occur during wide transformations where data is redistributed across partitions, necessitating a new stage. Among the options provided, only 'Shuffle' (A) directly causes a stage boundary. Caching (B) does not create a stage boundary; it materializes data but does not inherently involve shuffling. Executor failure (C), job delegation (D), and application failure (E) are related to runtime execution or fault tolerance but do not affect the logical division of stages. Therefore, the correct answer is A.

Explanation:

Comments (0)

No comments yet.

Which of the following operations triggers a stage boundary in Spark?

Exam-Like

Last updated: June 28, 2026 at 14:03

Shuffle

55.8%

Caching

9.5%

Executor failure

10.2%

Job delegation

15.0%

Application failure

9.5%