You have trained a Deep Neural Network (DNN) regressor using TensorFlow to predict housing prices based on a set of features such as location, size, and amenities. The model was developed with a default precision of tf.float64 and leveraged TensorFlow's estimator API. During evaluation, the model showed good predictive performance. However, before deploying the model to production, you noticed that the serving latency is 10 milliseconds at the 90th percentile using CPUs. The production environment has stricter requirements, necessitating a latency of no more than 8 milliseconds at the 90th percentile. You are prepared to accept a slight reduction in predictive accuracy to meet this latency requirement. To address this, you need to quickly reduce the serving latency and subsequently evaluate the impact on model accuracy. What should you try first to achieve the desired latency? | Google Professional Machine Learning Engineer Quiz