
Ultimate access to all questions.
A company is developing a new model to predict the prices of specific items. The model performed well on the training dataset. When the company deployed the model to production, the model's performance decreased significantly. What should the company do to mitigate this problem?
Explanation:
This question describes a classic machine learning problem known as overfitting. Here's what's happening:
Overfitting Scenario: The model performed well on the training dataset but poorly in production, which indicates the model has learned the training data too well, including noise and random fluctuations, rather than the underlying patterns.
Why Option C is Correct: Increasing the volume of training data helps the model learn more general patterns rather than memorizing specific examples. With more diverse data, the model is less likely to overfit and more likely to generalize well to new, unseen data in production.
Why Other Options are Incorrect:
Additional Solutions: Other approaches to mitigate overfitting include:
Key Takeaway: When a model performs well on training data but poorly in production, it's typically an overfitting issue, and increasing training data volume is one of the most effective ways to improve generalization.