Google Professional Data Engineer

Get started today

Ultimate access to all questions.

You are developing a new application that generates approximately 150 GB of JSON data daily by year's end. Your goals include decoupling producers from consumers, storing raw data efficiently in terms of space and cost, enabling near real-time SQL queries, and maintaining at least 2 years of historical data for SQL querying. Which pipeline best meets these requirements?

Real Exam

Develop an application with an API. Create a tool to poll the API and save data to Cloud Storage as gzipped JSON files.

0.0%

Develop an application that writes data to a Cloud SQL database. Set up periodic exports from the database to Cloud Storage and then load into BigQuery.

8.3%

Comments

Loading comments...

Develop an application that sends events to Cloud Pub/Sub, and implement a Cloud Dataflow pipeline to transform JSON event payloads into Avro, saving the data to Cloud Storage and BigQuery.

83.3%