LeetQuiz Logo
Privacy Policy•contact@leetquiz.com
© 2025 LeetQuiz All rights reserved.
Databricks Certified Machine Learning - Associate

Databricks Certified Machine Learning - Associate

Get started today

Ultimate access to all questions.


In the context of machine learning pipelines utilizing cross-validation, how should the pipeline and cross-validator be arranged when estimators or transformers are involved?

Real Exam



Explanation:

The arrangement depends on the presence of estimators or transformers in the pipeline. For instance, with estimators like StringIndexer, refitting is necessary each time if the pipeline is placed inside the cross-validator. However, to prevent data leakage from initial steps, it's safer to position the pipeline within the cross-validator. This approach ensures the cross-validator first splits the data before fitting the pipeline, avoiding potential information leakage from the hold-out set to the training set.

Powered ByGPT-5