Databricks Certified Machine Learning - Associate

Ultimate access to all questions.

A data scientist is attempting to use Spark ML to impute missing values in their PySpark DataFrame 'features_df'. The goal is to replace missing values in all numeric columns with the median of each respective column. However, the provided code snippet does not achieve this. What is the primary reason the code fails to perform the intended imputation? Choose the most accurate option.

Real Exam

Median value imputation is not supported in Spark ML.

0.0%

The code does not apply imputation to both training and test datasets at the same time.

11.1%

Loading comments...