
Answer-first summary for fast verification
Answer: Use Amazon Athena Federated Query to join the data from all data sources.
Amazon Athena Federated Query enables analysts to run SQL queries across data stored in relational, non-relational, object, and custom data sources without moving the data. This makes it the most cost-effective and least complex option for a one-time analysis job since there is no need to copy or move data beforehand.
Author: Ritesh Yadav
Ultimate access to all questions.
A data engineer needs to join data from multiple sources to perform a one-time analysis job. The data is stored in Amazon DynamoDB, Amazon RDS, Amazon Redshift, and Amazon S3. Which solution will meet this requirement MOST cost-effectively?
A
Use an Amazon EMR provisioned cluster to read from all sources. Use Apache Spark to join the data and perform the analysis.
B
Copy the data from DynamoDB, Amazon RDS, and Amazon Redshift into Amazon S3. Run Amazon Athena queries directly on the S3 files.
C
Use Amazon Athena Federated Query to join the data from all data sources.
D
Use Redshift Spectrum to query data from DynamoDB, Amazon RDS, and Amazon S3 directly from Redshift.
No comments yet.