Databricks Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

Explanation:

In this scenario, the job fails due to a specific task. Implementing a retry policy for that task allows for a specified number of retry attempts before the job is considered failed. This approach targets the root cause without unnecessarily increasing compute costs by retrying the entire job or running multiple instances. The retry policy can be configured under Advanced options by selecting Edit Retry Policy, where the retry interval is calculated in milliseconds between the start of the failed run and the subsequent retry run.

Explanation:

Comments (0)

No comments yet.

A data engineering team has an ETL job that runs every midnight but fails intermittently due to a specific task, requiring manual reruns in the morning. This issue is causing significant overhead. What approach can the team take to ensure the job completes every night while minimizing compute costs?

Real Exam

Monitor the task during execution to identify the cause of failure

5.1%

Implement a retry policy specifically for the task that fails periodically

79.1%

Apply a retry policy to the entire job

9.9%

Schedule the job to run multiple times to guarantee at least one completion

2.4%

Use a Jobs cluster for each task within the job

3.6%