Google Professional Data Engineer

Ultimate access to all questions.

To support the data science team's analysis, a data pipeline is required to transfer time-series transaction data into BigQuery. The dataset, initially 1.5 PB in size and growing by 3 TB daily, consists of thousands of transactions updated hourly with new statuses. This structured data will be utilized for building machine learning models. Which two strategies would best optimize performance and usability for the data science team? (Select two.)

Real Exam

Develop a data pipeline where status updates are appended to BigQuery instead of updated.

31.3%

Loading comments...