
Explanation:
In the Google Cloud Dataflow SDK, the ParDo operation is used to process each element in a PCollection. It is ideal for extracting specific fields, such as customer names, from each element. For more details, refer to the official documentation.
Ultimate access to all questions.
No comments yet.
You are tasked with analyzing customer data using Google's Dataflow SDK. The data includes entries like 'Tom,555 X street', 'Tim,553 Y street', and 'Sam, 111 Z street'. Your project requires extracting only the customer names from this data and writing them to an output PCollection. Which operation is most suitable for this task?
A
Sink API
B
Data extraction
C
ParDo
D
Source API