
Ultimate access to all questions.
Deep dive into the quiz with AI chat providers.
We prepare a focused prompt with your quiz and certificate details so each AI can offer a more tailored, in-depth explanation.
A company is using a fleet of Amazon EC2 instances to ingest data from on-premises data sources. The data is in JSON format and ingestion rates can be as high as 1 MB/s. When an EC2 instance is rebooted, the data in-flight is lost. The company's data science team wants to query ingested data in near-real time.
Which solution provides near-real-time data querying that is scalable with minimal data loss?
A
Publish data to Amazon Kinesis Data Streams, Use Kinesis Data Analytics to query the data.
B
Publish data to Amazon Kinesis Data Firehose with Amazon Redshift as the destination. Use Amazon Redshift to query the data.
C
Store ingested data in an EC2 instance store. Publish data to Amazon Kinesis Data Firehose with Amazon S3 as the destination. Use Amazon Athena to query the data.
D
Store ingested data in an Amazon Elastic Block Store (Amazon EBS) volume. Publish data to Amazon ElastiCache for Redis. Subscribe to the Redis channel to query the data.
Explanation:
Correct Answer: A
Why Option A is correct:
Why other options are incorrect:
Option B (Kinesis Data Firehose with Redshift):
Option C (EC2 instance store + Firehose + S3 + Athena):
Option D (EBS + ElastiCache for Redis):
Key AWS Services Understanding:
This solution addresses all requirements: scalability, minimal data loss, and near-real-time querying capabilities.