
Ultimate access to all questions.
As a data engineer managing a crucial daily data processing pipeline that includes data ingestion, transformation, and loading into a data warehouse, you receive an alert about a job failure during the transformation task due to an unexpected cluster outage. What is the best course of action to minimize downtime and ensure data integrity?
A
Permanently remove the transformation task from the pipeline to avoid future failures.
B
Ignore the alert and wait for the next scheduled run.
C
Manually restart the entire job from the beginning.
D
Use the repair feature to rerun only the failed transformation task after resolving the cluster issue.