AWS Certified Solutions Architect - Associate

Ultimate access to all questions.

Explanation:

Explanation

Correct Answers: B and E

Why B is correct:

Amazon S3 provides highly scalable, durable, and cost-effective storage for large volumes of documents
Amazon Athena allows running SQL queries directly on data stored in S3 without needing to manage infrastructure
This combination provides serverless, scalable querying capability that meets the requirement to "run SQL queries on the data"

Why E is correct:

AWS Lambda provides serverless, event-driven processing that automatically scales with the volume of documents
Amazon Textract is specifically designed for extracting text and data from scanned documents (not Amazon Rekognition, which is for image/video analysis)
Amazon Comprehend Medical is specifically designed to extract medical information from text (not Amazon Transcribe Medical, which is for speech-to-text)
This combination maximizes scalability and operational efficiency by using serverless services

Why other options are incorrect:

A (EC2 with MySQL):

C (Auto Scaling EC2 with custom application):

D (Lambda with Rekognition and Transcribe Medical):

Amazon Rekognition is for image/video analysis, not document text extraction
Amazon Transcribe Medical is for converting speech to text, not processing written documents
Wrong combination of services for the use case

Key AWS Services Used:

This solution provides maximum scalability (serverless services auto-scale) and operational efficiency (no infrastructure to manage).

Explanation:

Correct Answers: B and E

Why B is correct:

Amazon S3 provides highly scalable, durable, and cost-effective storage for large volumes of documents
Amazon Athena allows running SQL queries directly on data stored in S3 without needing to manage infrastructure
This combination provides serverless, scalable querying capability that meets the requirement to "run SQL queries on the data"

Why E is correct:

AWS Lambda provides serverless, event-driven processing that automatically scales with the volume of documents
Amazon Textract is specifically designed for extracting text and data from scanned documents (not Amazon Rekognition, which is for image/video analysis)
Amazon Comprehend Medical is specifically designed to extract medical information from text (not Amazon Transcribe Medical, which is for speech-to-text)
This combination maximizes scalability and operational efficiency by using serverless services

Why other options are incorrect:

A (EC2 with MySQL):

C (Auto Scaling EC2 with custom application):

D (Lambda with Rekognition and Transcribe Medical):

Amazon Rekognition is for image/video analysis, not document text extraction
Amazon Transcribe Medical is for converting speech to text, not processing written documents
Wrong combination of services for the use case

Key AWS Services Used:

This solution provides maximum scalability (serverless services auto-scale) and operational efficiency (no infrastructure to manage).

No comments yet.

Other

Community

UAnonymous

Last updated: May 9, 2026 at 14:02

Write the document information to an Amazon EC2 instance that runs a MySQL database.

50.0%

Write the document information to an Amazon S3 bucket. Use Amazon Athena to query the data.