
Ultimate access to all questions.
In a Spark MLlib implementation, you are working with a large dataset and need to perform data preprocessing to improve the quality of your machine learning model. Which of the following data preprocessing techniques can be applied in Spark MLlib, and how do they work?
A
Data cleaning, which involves handling missing values, outliers, and errors in the dataset.
B
Data transformation, which involves converting the data into a suitable format for machine learning models, such as normalization or standardization.
C
Feature engineering, which involves creating new features from existing data to improve the model's performance.
D
All of the above, as Spark MLlib supports various data preprocessing techniques.