
Ultimate access to all questions.
In the context of a machine learning project with large-scale datasets, which Databricks MLlib supported technique is most effective for sampling and processing data efficiently for model training?
A
Feature Scaling
B
Stratified Sampling
C
Outlier Detection
D
Data Imputation