Google Professional Data Engineer

Ultimate access to all questions.

Your Dataflow pipeline, which processes data from a Pub/Sub topic and writes it to a BigQuery dataset in the EU, is facing delays during peak periods due to all three n1-standard-1 workers reaching maximum CPU utilization. What are two effective strategies to enhance your pipeline's performance?

Real Exam

Switch to a larger instance type for your Dataflow workers to provide more CPU and memory resources.

44.4%

Consider creating a temporary buffer table in Bigtable before transferring data to BigQuery, though this may add complexity.

Loading comments...

Relocate your Dataflow pipeline to the us-central1 zone, which might not solve the issue and could increase latency.

5.6%

Opt for creating a temporary buffer table in Cloud Spanner as an intermediate step, acknowledging potential overhead.

5.6%