
Answer-first summary for fast verification
Answer: Integrate data lineage tracking for all data sources and track the entire data transformation pipeline.
Integrating data lineage tracking for all data sources and tracking the entire data transformation pipeline ensures comprehensive visibility into the data's journey from source to model. This approach helps in understanding the impact of data changes on the models and facilitates better data governance.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are tasked with establishing data lineage for machine learning models using Amazon SageMaker ML Lineage Tracking. The models are trained on data from various sources, including Amazon S3, Amazon Redshift, and on-premises databases. What steps should be taken to ensure comprehensive data lineage tracking?
A
Track only the final model output and ignore intermediate data transformations.
B
Track data lineage for each data source independently without integrating them.
C
Integrate data lineage tracking for all data sources and track the entire data transformation pipeline.
D
Track data lineage only for the most critical data sources.
No comments yet.