Google Professional Machine Learning Engineer

Ultimate access to all questions.

You are working on a large-scale machine learning project where you are using TensorFlow to train a model on a structured dataset containing 100 billion records. These records are currently stored in multiple CSV files. To optimize the input/output execution performance and ensure efficient data processing and training, what should you do?

Exam-Like

Load the data into BigQuery, and read the data from BigQuery.

15.6%

Load the data into Cloud Bigtable, and read the data from Bigtable.

10.2%

Loading comments...

Convert the CSV files into shards of TFRecords, and store the data in the Hadoop Distributed File System (HDFS).

9.0%