Ultimate access to all questions.
During the training of your machine learning model, you notice that GPU utilization is lower than expected due to a synchronous data loading process. The training dataset is divided into multiple files for input. To improve the model training efficiency and reduce the execution time of your input pipeline, what should you do?