
Ultimate access to all questions.
A data scientist is attempting to use Spark ML for imputing missing values in a PySpark DataFrame named 'features_df'. The goal is to replace missing values in all numeric columns with their respective median values. However, the provided code snippet fails to achieve this. What is the underlying issue with the code?_
A
The 'inputCols' and 'outputCols' parameters must be identical.
B
The 'fit' method should be used before 'transform' to create an 'ImputerModel'.
C
Median value imputation is not supported in Spark ML.
D
The code does not apply the imputer to both training and test datasets simultaneously.
E
The imputer needs to be configured with a different strategy other than 'median'.