
Answer-first summary for fast verification
Answer: Create a write stream in the committed type.
The BigQuery Storage Write API offers different stream types to handle data ingestion. The **committed type** stream ensures that once data is written, it is immediately committed and visible in BigQuery. This stream type automatically handles deduplication by leveraging offset management, preventing duplicates without requiring additional logic. In contrast, the default stream (buffered type) may require manual deduplication checks. Options C and D introduce external systems (Kafka, Pub/Sub) and extra processing steps (UUIDs, Cloud Functions), which add complexity and are not the simplest solution. Therefore, using the committed stream type (B) directly addresses the requirement with minimal operational effort.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are developing an application component to capture user behavior data and stream it to BigQuery using the BigQuery Storage Write API. You need to prevent duplicate data in BigQuery while using the simplest operational approach. What should you do?
A
Create a write stream in the default type.
B
Create a write stream in the committed type.
C
Configure a Kafka cluster. Use a primary universally unique identifier (UUID) for duplicate messages.
D
Configure a Pub/Sub topic. Use Cloud Functions to subscribe to the topic and remove any duplicates.
No comments yet.