Google Professional Data Engineer

Ultimate access to all questions.

You are managing a series of Spark jobs that execute on a Cloud Dataproc cluster based on a defined schedule. These jobs have dependencies; some must execute in a specific sequence, while others can run concurrently. Your task is to automate the execution and orchestration of these jobs. How should you proceed to achieve this automation effectively?

Exam-Like

Create a Cloud Dataproc Workflow Template

21.4%

Create an initialization action to execute the jobs

0.0%

Loading comments...

Create a Bash script that uses the Cloud SDK to create a cluster, execute jobs, and then tear down the cluster