AWS Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

You are responsible for designing a data pipeline that ingests data from multiple sources into an AWS data lake. One of the sources is a real-time streaming data source. How can you ensure that the data is ingested into the data lake in real-time, and the metadata is updated in the AWS Glue Data Catalog?

Simulated

Last updated: February 5, 2026 at 14:03

Use AWS Glue crawlers to periodically scan the streaming data source and update the data catalog.

16.7%

Implement an AWS Lambda function that triggers on new data in the streaming data source and updates the data catalog using the AWS Glue API.

12.5%

Comments

Loading comments...

Create an AWS Glue job that continuously monitors the streaming data source and ingests data into the data lake, updating the data catalog as needed.

8.3%

Leverage AWS Kinesis Data Streams to capture and process the real-time streaming data, and use AWS Kinesis Data Firehose to load the data into the data lake and update the data catalog.

62.5%