
Answer-first summary for fast verification
Answer: They can create a new task in the existing Job and then add it as a dependency of the original task.
## Explanation The correct answer is **B** because: 1. **Requirement**: The data engineer needs to run a new notebook **prior to** the original task. This means the new task should execute first, followed by the original task. 2. **Task Dependencies in Databricks Jobs**: In Databricks, you can set up task dependencies where one task must complete before another starts. When you add a task as a dependency of another task, the dependent task will wait for the upstream task to complete. 3. **Option B Analysis**: "They can create a new task in the existing Job and then add it as a dependency of the original task." This means: - Create a new task for the new notebook - Make this new task a dependency of the original task - Result: New task runs first → Original task runs after new task completes 4. **Why other options are incorrect**: - **A**: Cloning the existing task doesn't create a proper dependency chain - **C**: This would make the original task run first (opposite of what's needed) - **D**: Creating a new job from scratch is unnecessary and doesn't maintain the existing scheduling - **E**: Cloning to a new job creates separate jobs without proper coordination 5. **Best Practice**: Adding tasks with dependencies in the same job maintains the existing scheduling configuration and ensures proper execution order.
Author: Keng Suppaseth
Ultimate access to all questions.
A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task. Which of the following approaches can the data engineer use to set up the new task?
A
They can clone the existing task in the existing Job and update it to run the new notebook.
B
They can create a new task in the existing Job and then add it as a dependency of the original task.
C
They can create a new task in the existing Job and then add the original task as a dependency of the new task.
D
They can create a new job from scratch and add both tasks to run concurrently.
E
They can clone the existing task to a new Job and then edit it to run the new notebook.