
Answer-first summary for fast verification
Answer: Integrate data lineage tracking for all data sources and track the entire data transformation pipeline.
Integrating data lineage tracking for all data sources and tracking the entire data transformation pipeline ensures comprehensive visibility into the data's journey from source to processed output. This approach helps in understanding the impact of data changes on the processing pipelines and facilitates better data governance.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
You are tasked with establishing data lineage for data processing pipelines using AWS Glue. The pipelines process data from various sources, including Amazon S3, Amazon Redshift, and on-premises databases. What steps should be taken to ensure comprehensive data lineage tracking?
A
Track only the final processed output and ignore intermediate data transformations.
B
Track data lineage for each data source independently without integrating them.
C
Integrate data lineage tracking for all data sources and track the entire data transformation pipeline.
D
Track data lineage only for the most critical data sources.
No comments yet.