LeetQuiz Logo
Privacy Policy•contact@leetquiz.com
© 2025 LeetQuiz All rights reserved.
Google Professional Machine Learning Engineer

Google Professional Machine Learning Engineer

Get started today

Ultimate access to all questions.


You are an ML engineer at a mobile gaming company. A data scientist on your team recently trained a TensorFlow model for a mobile game application. Your task is to deploy this model into the mobile app to optimize its performance. Despite the model's good accuracy, you find that the inference latency does not meet the stringent production requirements of the mobile application. To ensure a smooth user experience, you need to reduce the inference time by 50%. Accepting a slight decrease in model accuracy is acceptable to meet the latency requirement. Without retraining a new model, which model optimization technique should you try first to reduce latency?

Exam-Like



Powered ByGPT-5