Ultimate access to all questions.
Your team is working on a machine learning project with highly imbalanced classes. Which Spark ML algorithm would you choose to tackle the class imbalance and enhance model performance?
Explanation:
The correct choice is Synthetic Minority Over-sampling Technique (SMOTE). SMOTE is specifically designed to address class imbalance by oversampling the minority class. In Spark ML, SMOTE can be utilized to create synthetic samples for the minority class, thereby balancing the class distribution. This approach significantly improves the model's learning capability from the minority class, making it highly effective for imbalanced datasets in machine learning projects.