Databricks Certified Machine Learning - Associate

Ultimate access to all questions.

When a data scientist integrates a random forest regressor pipeline as the final stage in a Spark ML Pipeline and initiates cross-validation, what is a potential downside of constructing the pipeline within the cross-validation process?

Real Exam

The process might not be able to parallelize tuning because of the pipeline's distributed nature.

4.3%

There's a risk of leaking data preparation details from validation sets to training sets for each model.

26.1%

Loading comments...