
Explanation:
Amazon Athena's query result reuse feature allows Athena to use the cached results of a previous query if a new query is submitted within a specified time window (up to 7 days). Since the data is only updated once a day and the BI application refreshes every 1 hour, reusing query results will drastically reduce costs and execution time by not scanning S3 repeatedly for the same query, without adding new infrastructure like ElastiCache.
Ultimate access to all questions.
Question 18
A financial company wants to use Amazon Athena to run on-demand SQL queries on a petabyte-scale dataset to support a business intelligence (BI) application. An AWS Glue job that runs during non-business hours updates the dataset once every day. The BI application has a standard data refresh frequency of 1 hour to comply with company policies. A data engineer wants to cost optimize the company's use of Amazon Athena without adding any additional infrastructure costs. Which solution will meet these requirements with the LEAST operational overhead?
A
Configure an Amazon S3 Lifecycle policy to move data to the S3 Glacier Deep Archive storage class after 1 day.
B
Use the query result reuse feature of Amazon Athena for the SQL queries.
C
Add an Amazon ElastiCache cluster between the BI application and Athena.
D
Change the format of the files that are in the dataset to Apache Parquet.
No comments yet.