Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
A company receives daily .csv file deliveries to an Amazon S3 bucket and needs to convert them into ApacheParquet format with minimal development for storage in a separate data bucket. What is the most efficient solution?
A
Set up an Amazon EMR cluster with Spark for data transformation and use EMRFS to store results in the target bucket.
B
Utilize AWS Glue for data discovery and ETL operations, directing output to the specified data bucket.
C
Employ AWS Batch with a Bash job definition for transformation, submitting as an array job to the target bucket.
D
Develop an AWS Lambda function to handle the conversion and configure S3 event notifications to trigger it automatically.