Ultimate access to all questions.
In the context of developing a machine learning model for a financial services company, the team encounters several data quality issues including missing values, outliers, and inconsistent data entries. Considering the constraints of regulatory compliance, cost efficiency, and the need for scalable solutions, what are the two most significant impacts of these data quality issues on the model's performance and deployment? Choose two correct options.
Explanation:
Correct Options: B and C. They can lead to inaccurate and biased predictions, and they can enhance the model's complexity unnecessarily.
Explanation: Data quality issues such as missing values, outliers, and inconsistent data entries can have profound impacts on a machine learning model's performance and its deployment in a regulated industry like financial services. Poor data quality can lead to biased models that make inaccurate predictions, which is particularly problematic in financial services where regulatory compliance and accuracy are paramount. Additionally, attempting to compensate for data quality issues can unnecessarily increase the model's complexity, making it more difficult to interpret and more costly to run, without necessarily improving its performance.
Why other options are incorrect: