Google Professional Machine Learning Engineer

Ultimate access to all questions.

In the context of preparing a dataset for a machine learning model, you encounter null values in a crucial categorical feature during exploratory data analysis. This issue could potentially introduce bias into your model. Considering the constraints of maintaining data integrity, minimizing bias, and ensuring the model's performance is not adversely affected, what is the optimal strategy to handle these missing values effectively? Choose the best option.

Real Exam

Replace the missing values with the mean of the feature, assuming the categorical data can be numerically encoded.

10.0%

Introduce a special category (e.g., 'Missing' or 'Unknown') to denote missing values, preserving the categorical nature of the feature.

Loading comments...