
Ultimate access to all questions.
As a data engineer responsible for managing ETL (Extract, Transform, Load) pipelines, you have observed that a streaming pipeline running on Google Cloud Dataflow is experiencing significant delays in processing incoming data, leading to output delays. Upon inspection, you found that Dataflow automatically optimized the pipeline graph, merging various steps into a single step. To resolve the issue, you need to identify the potential bottleneck in the pipeline. What steps should you take to diagnose the source of the delay?
A
Insert a Reshuffle operation after each processing step, and monitor the execution details in the Dataflow console.
B
Insert output sinks after each key processing step, and observe the writing throughput of each block.
C
Log debug information in each ParDo function, and analyze the logs at execution time.
D
Verify that the Dataflow service accounts have appropriate permissions to write the processed data to the output sinks.