
Answer-first summary for fast verification
Answer: Store the full dataset in BigQuery, and store a compressed copy of the data in a Cloud Storage bucket.
The best choice is C: 'Store the full dataset in BigQuery, and store a compressed copy of the data in a Cloud Storage bucket.' This solution allows you to perform data warehouse-style analytics using BigQuery, while also exposing the dataset as files for batch analysis tools in other cloud providers. Storing the data in BigQuery ensures high performance for analytics tasks, while keeping a compressed copy in Cloud Storage provides a cost-effective way of making the data accessible to other cloud providers. Options A and B do not address the requirement of making the dataset available for batch analysis tools in other cloud providers. Option D only partially meets the requirements, as it makes it difficult to query or analyze the archived data efficiently within BigQuery.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are tasked with designing an efficient storage and processing platform for a petabyte of analytics data. The platform must be capable of supporting data warehouse-style analytics within Google Cloud. Additionally, it should allow you to expose the dataset as files so that batch analysis tools in other cloud providers can also access and process the data. What approach should you take to fulfill these requirements?
A
Store and process the entire dataset in BigQuery.
B
Store and process the entire dataset in Bigtable.
C
Store the full dataset in BigQuery, and store a compressed copy of the data in a Cloud Storage bucket.
D
Store the warm data as files in Cloud Storage, and store the active data in BigQuery. Keep this ratio as 80% warm and 20% active.
No comments yet.