
Answer-first summary for fast verification
Answer: BigQuery: A fully-managed, serverless data warehouse service designed for storing and querying massive datasets quickly using SQL. It supports real-time data analysis, can handle petabytes of data, and scales automatically, making it ideal for big data analytics in a cost-effective manner.
BigQuery is the correct choice because it is specifically optimized for SQL-based queries on large datasets, offering real-time data analysis capabilities, the ability to handle petabytes of data efficiently, and automatic scaling. Its serverless nature ensures cost-effectiveness, making it the best fit for the company's requirements. Other options like Bigtable and Dataproc do not support SQL queries directly, and Cloud Spanner, while supporting SQL, is not optimized for large-scale analytics and may incur higher costs.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are tasked with designing a data analytics solution for a global retail company that generates petabytes of transaction data daily. The solution must support complex SQL queries for real-time analytics, be cost-effective, and scale seamlessly with the company's growth. Which Google Cloud service is the BEST choice for storing and querying this large dataset efficiently, and why? Choose the most appropriate option.
A
Bigtable: A NoSQL database service optimized for large-scale, low-latency workloads. While it handles massive amounts of data, it does not support SQL queries directly, making it unsuitable for the requirement of complex SQL queries.
B
Dataproc: A fully-managed service for running Apache Spark and Apache Hadoop clusters. It is designed for data processing and analytics but lacks the native SQL query support needed for real-time analytics on large datasets.
C
BigQuery: A fully-managed, serverless data warehouse service designed for storing and querying massive datasets quickly using SQL. It supports real-time data analysis, can handle petabytes of data, and scales automatically, making it ideal for big data analytics in a cost-effective manner.
D
Cloud Spanner: A globally distributed, horizontally scalable database service that supports SQL. While it offers high availability and strong consistency, it is optimized for transactional workloads rather than large-scale analytics, potentially leading to higher costs for analytics use cases.
No comments yet.