Databricks Certified Machine Learning - Associate

Get started today

Ultimate access to all questions.

In the context of Spark MLlib, compare and contrast the scalability of linear regression and decision trees. Provide examples of scenarios where one algorithm may be more suitable than the other based on the size and complexity of the dataset.

Simulated

Linear regression is more scalable than decision trees, as it requires less computational resources and can handle larger datasets.

8.3%

Decision trees are more scalable than linear regression, as they can handle non-linear relationships and complex interactions between features.

13.1%

Comments

Loading comments...

The scalability of linear regression and decision trees depends on the specific characteristics of the dataset, such as the number of features, the presence of non-linear relationships, and the size of the dataset.

63.7%