Databricks Certified Machine Learning - Associate

Get started today

Ultimate access to all questions.

You are working on a dataset with a numerical feature 'Age' that has some missing values. You have decided to impute these missing values using the mean value. Explain why it is important to add indicator variables for the missing values after imputation, and provide a scenario where this approach would be particularly useful.

Simulated

Adding indicator variables for missing values after imputation is not necessary, as the missing values have already been handled.

1.6%

Adding indicator variables for missing values after imputation is important to capture the uncertainty or difference in the distribution of the missing values compared to the observed values.

Comments

Loading comments...