
Ultimate access to all questions.
Deep dive into the quiz with AI chat providers.
We prepare a focused prompt with your quiz and certificate details so each AI can offer a more tailored, in-depth explanation.
A company wants to ingest customer payment data into the company's data lake in Amazon S3. The company receives payment data every minute on average. The company wants to analyze the payment data in real time. Then the company wants to ingest the data into the data lake.
Which solution will meet these requirements with the MOST operational efficiency?
A
Use Amazon Kinesis Data Streams to ingest data. Use AWS Lambda to analyze the data in real time.
B
Use AWS Glue to ingest data. Use Amazon Kinesis Data Analytics to analyze the data in real time.
C
Use Amazon Kinesis Data Firehose to ingest data. Use Amazon Kinesis Data Analytics to analyze the data in real time.
D
Use Amazon API Gateway to ingest data. Use AWS Lambda to analyze the data in real time.
Explanation:
Correct Answer: C
Amazon Kinesis Data Firehose is the most operationally efficient solution for this scenario because:
Real-time ingestion: Kinesis Data Firehose can ingest streaming data in real-time, which matches the requirement of receiving payment data every minute.
Built-in data lake integration: Kinesis Data Firehose has native integration with Amazon S3 for data lake storage, automatically delivering data to S3 without requiring additional services or custom code.
Real-time analytics: Amazon Kinesis Data Analytics can process streaming data in real-time, allowing for immediate analysis of payment data as it arrives.
Operational efficiency: Kinesis Data Firehose is a fully managed service that handles data delivery, buffering, compression, and encryption automatically, minimizing operational overhead.
Why other options are less optimal:
Option A: While Kinesis Data Streams + Lambda can work, it requires more operational management. Lambda functions need to be written and maintained to process data and write to S3, adding operational complexity.
Option B: AWS Glue is primarily a batch ETL service, not designed for real-time ingestion. It's not suitable for minute-by-minute data ingestion requirements.
Option D: API Gateway is designed for API management, not for continuous data ingestion. It would require additional services and custom code to handle streaming data delivery to S3.
Key AWS Services:
This combination provides the most operational efficiency by leveraging fully managed services that work seamlessly together for real-time streaming data ingestion, processing, and storage.