AWS Certified AI Practitioner

Get started today

Ultimate access to all questions.

Explanation:

Detailed Explanation

To build a responsible machine learning model that minimizes bias in product recommendations, the company should focus on ensuring that the data is balanced and collected from a diverse group (Option C). This approach directly addresses the core challenge of bias reduction in ML models.

Why Option C is Optimal:

Representative Data Collection: By collecting data from a diverse group of customers across various demographics (age, gender, location, income levels, etc.), the model learns patterns that are inclusive rather than skewed toward any particular segment. This prevents the model from overfitting to dominant groups and underperforming for underrepresented ones.
Balanced Dataset: A balanced dataset ensures that no single demographic or behavioral group disproportionately influences the model's predictions. This is crucial for fairness, as imbalanced data can lead to biased recommendations that favor majority groups while neglecting minority preferences.
Mitigation of Historical Biases: Retail data often reflects historical purchasing patterns that may contain societal or systemic biases. Actively seeking diversity in data collection helps counteract these embedded biases, leading to more equitable recommendations.
Alignment with AWS Responsible AI Principles: AWS emphasizes fairness and bias mitigation as key components of responsible AI. Collecting diverse, balanced data aligns with AWS best practices for building inclusive ML models.

Why Other Options Are Less Suitable:

Option A (Use data from only customers who match the demographics of the company's overall customer base): This approach perpetuates existing biases. If the current customer base is not diverse, this would reinforce historical inequalities rather than reduce bias.
Option B (Collect data from customers who have a past purchase history): While purchase history is valuable for recommendation systems, relying solely on it excludes new or potential customers. This creates a "rich get richer" bias where existing customers receive better recommendations while new ones are disadvantaged.
Option D (Ensure that the data is from a publicly available dataset): Public datasets may not be representative of the company's specific customer base and often lack the diversity needed for fair recommendations. They may also contain their own biases that don't align with the company's responsible AI goals.

Best Practice Implementation:

The company should implement systematic data collection strategies that actively seek representation from all customer segments, including those that may be historically underrepresented. This involves:

Stratified sampling across demographic dimensions
Regular audits of data diversity
Inclusion of edge cases and minority preferences
Continuous monitoring for bias during model development and deployment

By prioritizing diverse and balanced data collection, the company builds a foundation for fair, inclusive product recommendations that serve all customers equitably.

Explanation:

Detailed Explanation

Why Option C is Optimal:

Representative Data Collection: By collecting data from a diverse group of customers across various demographics (age, gender, location, income levels, etc.), the model learns patterns that are inclusive rather than skewed toward any particular segment. This prevents the model from overfitting to dominant groups and underperforming for underrepresented ones.
Balanced Dataset: A balanced dataset ensures that no single demographic or behavioral group disproportionately influences the model's predictions. This is crucial for fairness, as imbalanced data can lead to biased recommendations that favor majority groups while neglecting minority preferences.
Mitigation of Historical Biases: Retail data often reflects historical purchasing patterns that may contain societal or systemic biases. Actively seeking diversity in data collection helps counteract these embedded biases, leading to more equitable recommendations.
Alignment with AWS Responsible AI Principles: AWS emphasizes fairness and bias mitigation as key components of responsible AI. Collecting diverse, balanced data aligns with AWS best practices for building inclusive ML models.

Why Other Options Are Less Suitable:

Option A (Use data from only customers who match the demographics of the company's overall customer base): This approach perpetuates existing biases. If the current customer base is not diverse, this would reinforce historical inequalities rather than reduce bias.
Option B (Collect data from customers who have a past purchase history): While purchase history is valuable for recommendation systems, relying solely on it excludes new or potential customers. This creates a "rich get richer" bias where existing customers receive better recommendations while new ones are disadvantaged.
Option D (Ensure that the data is from a publicly available dataset): Public datasets may not be representative of the company's specific customer base and often lack the diversity needed for fair recommendations. They may also contain their own biases that don't align with the company's responsible AI goals.

Best Practice Implementation:

Stratified sampling across demographic dimensions
Regular audits of data diversity
Inclusion of edge cases and minority preferences
Continuous monitoring for bias during model development and deployment

By prioritizing diverse and balanced data collection, the company builds a foundation for fair, inclusive product recommendations that serve all customers equitably.

Comments (0)

No comments yet.

Which responsible practice should a retail company implement during data collection to reduce bias in a machine learning model for product recommendations?

Exam-Like

Last updated: May 12, 2026 at 14:02

Use data from only customers who match the demographics of the company's overall customer base.

0.0%

Collect data from customers who have a past purchase history.

0.0%

Ensure that the data is balanced and collected from a diverse group.

100.0%