
Answer-first summary for fast verification
Answer: Use Amazon Athena for batch processing and AWS Lambda for real-time processing.
For batch processing, Amazon Athena is a suitable choice as it allows you to use SQL to query data in S3 without the need to set up and manage any servers. For real-time processing, AWS Lambda can be used to run code in response to events, such as changes to data in S3. This combination ensures both efficient batch processing and real-time data cleaning, maintaining high data quality.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
Given a scenario where you need to verify and clean a dataset stored in S3 before performing any analysis, which AWS services would you use to automate the data cleaning process and ensure data quality? Consider the need for both batch and real-time processing.
A
Use AWS Glue for batch processing and Amazon Kinesis for real-time processing.
B
Use Amazon Athena for batch processing and AWS Lambda for real-time processing.
C
Use Amazon SageMaker Data Wrangler for batch processing and Amazon QuickSight for real-time processing.
D
Use AWS Lambda for batch processing and Amazon Athena for real-time processing.
No comments yet.