
Ultimate access to all questions.
In the context of developing a machine learning model for a retail company to predict customer churn, which of the following steps should be considered the initial and most critical phase in the machine learning pipeline? The project has constraints including a tight budget, the need for scalability to handle millions of customers, and compliance with data privacy regulations. Choose the best option.
A
Model training, as it directly influences the predictive accuracy of the customer churn model.
B
Data pre-processing, to ensure the data is clean and formatted correctly for analysis.
C
Model evaluation, to select the best performing model based on predefined metrics.
D
Data collection, to gather comprehensive and compliant customer data necessary for any subsequent analysis.
E
Both A and C are equally important initial steps.