AWS Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

You are tasked with designing an ETL pipeline for a large e-commerce company that needs to process millions of transactions daily. The pipeline should be able to handle high throughput and low latency. Which AWS services would you integrate to create this pipeline, and how would you configure them to meet the requirements?

Simulated

Use AWS Glue for ETL processing, AWS Lambda for serverless computing, and Amazon S3 for data storage. Configure Glue to run on a schedule and Lambda to trigger on new data arrival.

10.0%

Use Amazon Kinesis for real-time data streaming, AWS Glue for ETL processing, and Amazon Redshift for data warehousing. Configure Kinesis to capture data in real-time, Glue to process data as it arrives, and Redshift to store and analyze the processed data.

Comments

Loading comments...

Use AWS Data Pipeline for scheduling and workflow management, AWS Lambda for serverless computing, and Amazon DynamoDB for data storage. Configure Data Pipeline to schedule Lambda functions based on dependencies, and Lambda to process data stored in DynamoDB.

Use Amazon S3 for data storage, AWS Step Functions for workflow management, and AWS Glue for ETL processing. Configure Step Functions to manage the workflow, Glue to process data stored in S3, and trigger the workflow based on a schedule.

20.0%