
Answer-first summary for fast verification
Answer: Apache Parquet format compressed with Snappy
Option C is CORRECT because using Apache Parquet format compressed with Snappy is an optimal solution for Amazon Athena queries. Parquet is a columnar storage file format that is highly efficient for query performance, especially for large datasets. It significantly reduces the amount of data scanned during queries, which can greatly improve query runtime and reduce costs. Snappy compression is efficient and fast, providing a good balance between compression ratio and decompression speed.
Author: Ritesh Yadav
Ultimate access to all questions.
Question 37/60
A company stores 10 to 15 TB of uncompressed .csv files in Amazon S3. The company is evaluating Amazon Athena as a one-time query engine.
The company wants to transform the data to optimize query runtime and storage costs.
Which file format and compression solution will meet these requirements for Athena queries?
A
.csv format compressed with zip
B
JSON format compressed with bzip2
C
Apache Parquet format compressed with Snappy
D
Apache Avro format compressed with LZO
No comments yet.