
Answer-first summary for fast verification
Answer: Use Datastream to capture changes, and use Dataflow to write those changes to BigQuery.
To achieve near-real time replication of tables from Cloud SQL for MySQL to BigQuery using Google-managed services, the best approach is to use Datastream in combination with Dataflow. Datastream is a serverless and easy-to-use change data capture (CDC) and replication service that allows you to synchronize data across heterogeneous databases, storage systems, and applications reliably and with minimal latency. Dataflow, on the other hand, is a fully managed service for stream and batch processing that can be used to transform and write the captured changes into BigQuery. This combination ensures fast, reliable, and Google-managed data replication. Option A suggests developing a custom solution, which would not leverage Google-managed services and would require significant maintenance. Option B, using Cloud SQL federated queries, is not suitable for near-real time replication as it is designed for querying data across different databases rather than replicating data. Option C, using Database Migration Service, is more suited for migrating databases rather than replicating specific tables in near-real time.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
How can you set up fast and reliable near-real-time replication of specific tables from a privately accessed Cloud SQL for MySQL instance to BigQuery using Google-managed services for analytics and machine learning purposes?
A
Develop a custom data replication service to send data into BigQuery.
B
Use Cloud SQL federated queries.
C
Use Database Migration Service to replicate tables into BigQuery.
D
Use Datastream to capture changes, and use Dataflow to write those changes to BigQuery.
No comments yet.