
Explanation:
Correct answer: C. VectorAssembler
Explanation:
In Databricks and Spark ML, the VectorAssembler is a crucial feature transformer that merges multiple columns of data (be it numerical, boolean, or vector type) into a single vector column. This step is essential for machine learning algorithms in Spark ML, as they typically require each instance's features to be presented as a unified vector. The VectorAssembler efficiently prepares the dataset to meet this requirement.
Why other options are incorrect:
Ultimate access to all questions.
In Databricks, which component is specifically designed to transform a column of scalar values into a column of vector type, a requirement for an estimator's .fit() method? Choose the best answer.
A
VectorScaler
B
VectorConverter
C
VectorAssembler
D
VectorTransformer
No comments yet.