Google Professional Machine Learning Engineer

Get started today

Ultimate access to all questions.

Explanation:

The correct answer is D: Add parallel interleave to the pipeline. When data is split into multiple files and you want to improve the execution time of your input pipeline, using parallel interleave can help. Parallel interleave allows for parallel reading and processing of input data, which reduces bottlenecks caused by synchronous data loading. This improves the overall throughput and efficiency of your model training process, especially when dealing with large datasets split across multiple files.

Explanation:

Comments (0)

No comments yet.

During the training of your machine learning model, you notice that GPU utilization is lower than expected due to a synchronous data loading process. The training dataset is divided into multiple files for input. To improve the model training efficiency and reduce the execution time of your input pipeline, what should you do?

Exam-Like

Increase the CPU load

3.1%

Add caching to the pipeline

9.3%

Increase the network bandwidth

3.6%

Add parallel interleave to the pipeline

83.9%