Google Professional Machine Learning Engineer

Get started today

Ultimate access to all questions.

Explanation:

The correct answer is D. Running training-serving skew detection batch jobs every few days to compare the aggregate statistics of the features in the training dataset with recent serving data is an effective method to monitor when the model's performance has degraded. This approach will allow you to efficiently detect any discrepancies or shifts in the data distribution that can lead to performance degradation. When skew is detected, sending the most recent serving data to the labeling service can help in timely evaluation and retraining of the model, ensuring consistent performance while minimizing the costs associated with frequent labeling.

Explanation:

Comments (0)

No comments yet.

You deployed an ML model into production a year ago to serve predictions for a business-critical application. To ensure the model remains accurate, you have been collecting all raw requests sent to your model prediction service each month and sending a subset of these requests to a human labeling service to evaluate your model’s performance. Over the past year, you have observed varying patterns in model degradation—sometimes the performance drops significantly within a month, while other times it takes several months to notice such decreases. Given that the human labeling service is expensive, you need a strategy to balance the frequency of retraining the model to maintain high performance without incurring unnecessary costs. What should you do?

Exam-Like

Train an anomaly detection model on the training dataset, and run all incoming requests through this model. If an anomaly is detected, send the most recent serving data to the labeling service.

13.1%

Identify temporal patterns in your model’s performance over the previous year. Based on these patterns, create a schedule for sending serving data to the labeling service for the next year.

11.0%

Compare the cost of the labeling service with the lost revenue due to model performance degradation over the past year. If the lost revenue is greater than the cost of the labeling service, increase the frequency of model retraining; otherwise, decrease the model retraining frequency.

10.3%

Run training-serving skew detection batch jobs every few days to compare the aggregate statistics of the features in the training dataset with recent serving data. If skew is detected, send the most recent serving data to the labeling service.