
Ultimate access to all questions.
In the context of preparing data for a machine learning model, a team is evaluating the importance of data quality assessment. The dataset includes customer transactions from multiple sources with varying formats, missing values, and potential outliers. The team aims to build a model that predicts customer churn with high accuracy. Given the scenario, why is assessing data quality a critical step in machine learning data preparation? (Choose one correct option)
A
It ensures that all data sources are utilized without exception, maximizing the dataset's size.
B
It introduces unnecessary complexity into the dataset, making the model harder to interpret.
C
It aids in the detection and correction of data anomalies and errors, ensuring the model's predictions are based on reliable data.
D
It significantly decreases the necessity for data storage, reducing costs associated with large datasets.