LeetQuiz Logo
Privacy Policy•contact@leetquiz.com
© 2025 LeetQuiz All rights reserved.
Google Professional Machine Learning Engineer

Google Professional Machine Learning Engineer

Get started today

Ultimate access to all questions.


You are working on a machine learning project where you need to train a TensorFlow model on a structured dataset containing 100 billion records, stored across multiple CSV files. The project is constrained by tight deadlines and a limited budget, requiring an efficient solution that minimizes costs while maximizing performance. Additionally, the solution must be scalable to accommodate future data growth. Given these constraints, which of the following approaches would BEST improve the input/output execution performance for training your TensorFlow model? Choose the two most effective options.

Real Exam




Explanation:

The most effective method to enhance I/O performance during training is to convert the CSV files into shards of TFRecords and store them in Cloud Storage. TFRecord, a binary format, is optimized for TensorFlow, ensuring efficient storage and retrieval. This approach benefits from efficient data format, parallel processing, scalability, and TensorFlow integration. Implementing a combination of TFRecords in Cloud Storage and BigQuery (Option E) can also be effective for scenarios requiring complex queries on the dataset, though it may introduce additional costs. Other options like Cloud Bigtable (Option A) are tailored for NoSQL data, not structured CSV files, and HDFS (Option B), while viable, doesn't offer the same level of integration and management as Cloud Storage for TensorFlow on Google Cloud. BigQuery (Option C) lacks the specialized optimization for TensorFlow workflows.

Powered ByGPT-5