
Explanation:
The correct answer is C.
gsutil for the migration: gsutil is a command-line tool that allows you to manage Cloud Storage resources. It is a reliable and efficient tool for transferring large files to Google Cloud Storage. Given the files are large (90 GB each), gsutil is a suitable choice for this task.
Pub/Sub and Dataflow for the real-time updates:
Why the other options are incorrect:
A: BigQuery Data Transfer Service is not suitable for migrating data from an on-premises data center to Google Cloud Storage. Dataproc, while capable of real-time data processing, is not as well-suited for streaming data processing as Dataflow.
B: While gsutil is appropriate for the initial data migration, it is not designed for real-time updates. Pub/Sub and Dataflow are better suited for this purpose.
D: Storage Transfer Service is primarily for transferring data from on-premises storage to Cloud Storage and is unnecessary in this scenario. Cloud Data Fusion can be used for real-time data processing, but Dataflow is more appropriate for continuously updating a data warehouse in real time.
Ultimate access to all questions.
No comments yet.
You are in the process of migrating your data warehouse to Google Cloud and shutting down your on-premises data center. You have allocated bandwidth for the initial data load to the cloud, where each file is 90 GB in size, and you need your transactional systems to update the warehouse on Google Cloud in real time. Which tools should you use for both the data migration and ensuring real-time updates to your warehouse on Google Cloud?
A
BigQuery Data Transfer Service for the migration; Pub/Sub and Dataproc for the real-time updates
B
gsutil for both the migration and the real-time updates
C
gsutil for the migration; Pub/Sub and Dataflow for the real-time updates
D
Storage Transfer Service for the migration; Pub/Sub and Cloud Data Fusion for the real-time updates