
Ultimate access to all questions.
You're assessing a Dataflow pipeline that processes gzip-compressed text files, manages errors by directing them to a dead-letter queue, and employs SideInputs for data joining. The pipeline is running slower than expected. What strategies can you implement to accelerate the Dataflow job?
A
Retry records that encounter errors.
B
Decrease the batch size.
C
Opt for compressed Avro files instead.
D
Replace SideInput with CoGroupByKey.