
Answer-first summary for fast verification
Answer: Data pre-processing, where data is cleaned, transformed, and features are engineered to meet quality and compliance standards efficiently., Data collection, ensuring all financial data is gathered in compliance with regulations.
Correct Answer: D. Data pre-processing Explanation: Data pre-processing is a critical phase in the ML pipeline, especially in a regulated industry like financial services. This phase includes data cleaning to address missing values and outliers, data transformation for normalization and standardization, and feature engineering to extract valuable insights from raw data. Automating these tasks ensures data integrity, compliance with financial regulations, and cost-efficiency by reducing manual errors and saving time. Scalability is also addressed by automating the handling of large datasets. Incorrect Options: - **A. Model evaluation**: While important, this phase focuses on assessing model performance, not data quality or compliance. - **B. Model deployment**: This phase involves placing the model into production, not preparing the data. - **C. Data collection**: Although crucial, this phase is about gathering data, not ensuring its quality or relevance for model training. - **E. Both C and D**: While both are important, data pre-processing is the phase primarily concerned with ensuring data quality and relevance for model training, making D the best single answer.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
In the context of automating a machine learning pipeline for a financial services company, which phase is critical for ensuring the quality and relevance of data before model training? The company emphasizes cost-efficiency, compliance with financial regulations, and scalability to handle large datasets. Choose the best option that describes the phase primarily concerned with data pre-processing and feature engineering, considering the given constraints.
A
Model evaluation, as it ensures the model meets regulatory compliance before deployment.
B
Model deployment, focusing on scalable infrastructure to handle production loads.
C
Data collection, ensuring all financial data is gathered in compliance with regulations.
D
Data pre-processing, where data is cleaned, transformed, and features are engineered to meet quality and compliance standards efficiently.
E
Both C and D, as data collection and pre-processing are equally important for compliance and quality.