To facilitate a series of sequential load and transformation jobs, consider the following scenario: Data files are incrementally added to a Cloud Storage bucket through an upstream process, but their arrival times are not predetermined. Upon the arrival of new data, a Dataproc job is triggered to execute initial transformations and then store the processed data in BigQuery. Following this, additional transformation jobs, which vary for each table, must be executed within BigQuery. These subsequent jobs can potentially take several hours to complete. Your task is to identify the most efficient and maintainable workflow that can manage the processing of hundreds of tables while consistently delivering the freshest data to your end users. What strategy should you implement? | Google Professional Data Engineer Quiz - LeetQuiz

To facilitate a series of sequential load and transformation jobs, consider the following scenario: Data files are incrementally added to a Cloud Storage bucket through an upstream process, but their arrival times are not predetermined. Upon the arrival of new data, a Dataproc job is triggered to execute initial transformations and then store the processed data in BigQuery. Following this, additional transformation jobs, which vary for each table, must be executed within BigQuery. These subsequent jobs can potentially take several hours to complete. Your task is to identify the most efficient and maintainable workflow that can manage the processing of hundreds of tables while consistently delivering the freshest data to your end users. What strategy should you implement?

Exam-Like

A

Create an Apache Airflow directed acyclic graph (DAG) in Cloud Composer with sequential tasks by using the Cloud Storage, Dataproc, and BigQuery operators.
Use a single shared DAG for all tables that need to go through the pipeline.
Schedule the DAG to run hourly.

16.2%

B

Create an Apache Airflow directed acyclic graph (DAG) in Cloud Composer with sequential tasks by using the Cloud Storage, Dataproc, and BigQuery operators.
Create a separate DAG for each table that needs to go through the pipeline.
Schedule the DAGs to run hourly.

8.1%

C

Create an Apache Airflow directed acyclic graph (DAG) in Cloud Composer with sequential tasks by using the Dataproc and BigQuery operators.
Use a single shared DAG for all tables that need to go through the pipeline.
Use a Cloud Storage object trigger to launch a Cloud Function that triggers the DAG.

18.9%

D

Create an Apache Airflow directed acyclic graph (DAG) in Cloud Composer with sequential tasks by using the Dataproc and BigQuery operators.
Create a separate DAG for each table that needs to go through the pipeline.
Use a Cloud Storage object trigger to launch a Cloud Function that triggers the DAG.

56.8%

Powered ByGPT-5.2