
Ultimate access to all questions.
In the context of a machine learning project aimed at predicting customer churn for a telecommunications company, the data science team is in the process of preparing their dataset for model training. The dataset includes customer demographics, service usage patterns, customer service interactions, and billing information collected over the past two years. Given the project's goal to accurately identify at-risk customers, why is evaluating the quality of this dataset critical? Choose the two most important reasons.
A
To minimize the storage costs associated with the dataset
B
To ensure the dataset's features are computationally efficient for model training
C
To verify the dataset's reliability and appropriateness for building predictive models
D
To enhance the visual appeal of the data visualization dashboard
E
To identify and rectify data inconsistencies, missing values, and outliers that could bias the model's predictions