
Ultimate access to all questions.
You are designing an automated Machine Learning (ML) pipeline for a financial services company that processes large volumes of transactional data daily. The pipeline must efficiently handle data ingestion, preprocessing, model training, evaluation, and deployment, with a strong emphasis on reliability and scalability. Given the complexity of the pipeline, which phase is primarily responsible for the scheduling and triggering of pipeline executions to ensure seamless operation across all stages? Choose the best option.
A
Data collection, as it initiates the pipeline by gathering the necessary data for processing.
B
Data preprocessing, where data is cleaned and transformed before model training.
C
Model deployment, which involves making the trained model available for predictions.
D
Orchestration and scheduling, which manages the workflow and triggers each phase of the pipeline at the correct time.
E
None of the above, as pipeline scheduling is handled by an external system not part of the ML pipeline.