
Ultimate access to all questions.
You are responsible for designing a data pipeline that ingests data from multiple sources into an AWS data lake. One of the sources is a real-time streaming data source. How can you ensure that the data is ingested into the data lake in real-time, and the metadata is updated in the AWS Glue Data Catalog?
A
Use AWS Glue crawlers to periodically scan the streaming data source and update the data catalog.
B
Implement an AWS Lambda function that triggers on new data in the streaming data source and updates the data catalog using the AWS Glue API.
C
Create an AWS Glue job that continuously monitors the streaming data source and ingests data into the data lake, updating the data catalog as needed.
D
Leverage AWS Kinesis Data Streams to capture and process the real-time streaming data, and use AWS Kinesis Data Firehose to load the data into the data lake and update the data catalog.