AWS Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

Explanation:

Integrating data lineage tracking for all data sources and tracking the entire data transformation pipeline ensures comprehensive visibility into the data's journey from source to processed output. This approach helps in understanding the impact of data changes on the processing pipelines and facilitates better data governance.

Explanation:

Comments (0)

No comments yet.

You are tasked with establishing data lineage for data processing pipelines using AWS Glue. The pipelines process data from various sources, including Amazon S3, Amazon Redshift, and on-premises databases. What steps should be taken to ensure comprehensive data lineage tracking?

Simulated

Track only the final processed output and ignore intermediate data transformations.

9.1%

Track data lineage for each data source independently without integrating them.

9.1%

Integrate data lineage tracking for all data sources and track the entire data transformation pipeline.

72.7%

Track data lineage only for the most critical data sources.

9.1%