Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
In the context of a machine learning project with large-scale datasets, which Databricks MLlib supported technique is most effective for sampling and processing data efficiently for model training?
A
Feature Scaling
B
Stratified Sampling
C
Outlier Detection
D
Data Imputation