
Ultimate access to all questions.
In the context of automating a machine learning pipeline for a financial services company, which phase is critical for ensuring the quality and relevance of data before model training? The company emphasizes cost-efficiency, compliance with financial regulations, and scalability to handle large datasets. Choose the best option that describes the phase primarily concerned with data pre-processing and feature engineering, considering the given constraints.
A
Model evaluation, as it ensures the model meets regulatory compliance before deployment.
B
Model deployment, focusing on scalable infrastructure to handle production loads.
C
Data collection, ensuring all financial data is gathered in compliance with regulations.
D
Data pre-processing, where data is cleaned, transformed, and features are engineered to meet quality and compliance standards efficiently.
E
Both C and D, as data collection and pre-processing are equally important for compliance and quality.