Databricks Certified Machine Learning - Associate

Ultimate access to all questions.

In a Spark MLlib implementation, you are working with a large dataset and need to build a model for a binary classification task. Which of the following algorithms would be most suitable for this task, and why?

Simulated

Linear regression, as it can be used for both regression and classification tasks.

3.7%

Decision trees, as they can handle non-linear relationships and complex interactions between features.

14.1%

Loading comments...

Random forests, as they are an ensemble method that combines multiple decision trees to improve accuracy and robustness.

23.0%