Databricks Certified Machine Learning - Associate

Ultimate access to all questions.

A data scientist is attempting to use Spark ML for imputing missing values in a PySpark DataFrame named 'features_df'. The goal is to replace missing values in all numeric columns with their respective median values. However, the provided code snippet fails to achieve this. What is the underlying issue with the code?

Real Exam

The 'inputCols' and 'outputCols' parameters must be identical.

5.0%

The 'fit' method should be used before 'transform' to create an 'ImputerModel'.

Loading comments...