Google Professional Machine Learning Engineer

Get started today

Ultimate access to all questions.

Explanation:

To address the issue of biased model performance due to imbalanced training data without collecting additional data, two effective strategies are: implementing cost-sensitive learning to penalize the model more for errors on minority groups, which encourages the model to improve its performance on these groups; and using SMOTE to synthetically increase the representation of minority groups in the training data, thereby enhancing the model's ability to learn from these groups. These approaches directly tackle the imbalance issue while adhering to compliance constraints.

Explanation:

Comments (0)

No comments yet.

You are a Machine Learning Engineer at a tech company that has recently deployed a model to predict loan approvals. After three months of deployment, an audit reveals that the model's performance is significantly worse for applicants from certain demographic subgroups, raising concerns about biased outcomes. The investigation suggests that the training data was imbalanced, with underrepresented groups not adequately represented. Due to privacy regulations, collecting additional data is not an option. The company is now looking for strategies to mitigate this bias without violating compliance constraints. Which two strategies would you recommend to best address this issue? (Choose two.)

Real Exam

Document the model's limitations and provide a detailed explanation of its behavior to the stakeholders, without making any changes to the model.

1.7%

Remove data points from overrepresented groups to balance the dataset and retrain the model, despite the reduction in overall dataset size.

5.2%

Implement a cost-sensitive learning approach by adjusting the loss function to impose a higher penalty for misclassifications in the minority groups, and retrain the model.

34.5%

Apply synthetic minority over-sampling technique (SMOTE) to the existing dataset to increase the representation of minority groups, and retrain the model.