Google Professional Machine Learning Engineer

Get started today

Ultimate access to all questions.

Explanation:

Option A is the correct answer because it suggests using a Dataflow job to create sharded TFRecord files in a Cloud Storage directory. This approach is both efficient and scalable for handling millions of labeled images. By referencing tf.data.TFRecordDataset in the training script, you can take advantage of TensorFlow's optimized data pipeline capabilities. Additionally, using Vertex AI Training with a V100 GPU ensures that the model training process is both powerful and scalable, leveraging dedicated hardware designed for high-performance machine learning tasks. Option A provides a low-maintenance pipeline that ensures maximum efficiency and scalability, which aligns with the requirements.

Explanation:

Comments (0)

No comments yet.

You are tasked with training an image classification model using TensorFlow. The dataset at your disposal is stored in a Cloud Storage directory and consists of millions of labeled images. Prior to commencing model training, data preparation is crucial. Your goal is to ensure that the entire data preprocessing and model training workflow is efficient, scalable, and requires minimal maintenance. Given these requirements, which approach should you take?

Create a Dataflow job that creates sharded TFRecord files in a Cloud Storage directory. 2. Reference tf.data.TFRecordDataset in the training script. 3. Train the model by using Vertex AI Training with a V100 GPU.

71.6%

Create a Dataflow job that moves the images into multiple Cloud Storage directories, where each directory is named according to the corresponding label. 2. Reference tfds.folder_dataset:ImageFolder in the training script. 3. Train the model by using Vertex AI Training with a V100 GPU.

16.8%

Create a Jupyter notebook that uses an nt-standard-64 V100 GPU Vertex AI Workbench instance. 2. Write a Python script that creates sharded TFRecord files in a directory inside the instance. 3. Reference tf.data.TFRecordDataset in the training script. 4. Train the model by using the Workbench instance.

6.3%

Create a Jupyter notebook that uses an n1-standard-64, V100 GPU Vertex AI Workbench instance. 2. Write a Python script that copies the images into multiple Cloud Storage directories, where each directory is named according to the corresponding label. 3. Reference tfds.folder_dataset.ImageFolder in the training script. 4. Train the model by using the Workbench instance.

5.3%