
Answer-first summary for fast verification
Answer: Create a manifest file that contains the data file locations. Use a COPY command to load the data into Amazon Redshift.
You can use a manifest file with the Amazon Redshift COPY command to ensure that the load process loads all required files, and loads them in parallel, which maximizes data ingestion speed. It does not introduce extra services, so it meets the requirement of not increasing costs.
Author: Ritesh Yadav
Ultimate access to all questions.
Question 32
The company loads all the data files into one table in the Redshift cluster by using a separate COPY command for each data file location. This approach takes a long time to load all the data files into the table. The company must increase the speed of the data ingestion. The company does not want to increase the cost of the process. Which solution will meet these requirements?
A
Use a provisioned Amazon EMR cluster to copy all the data files into one folder. Use a COPY command to load the data into Amazon Redshift.
B
Load all the data files in parallel into Amazon Aurora. Run an AWS Glue job to load the data into Amazon Redshift.
C
Use an AWS Glue job to copy all the data files into one folder. Use a COPY command to load the data into Amazon Redshift.
D
Create a manifest file that contains the data file locations. Use a COPY command to load the data into Amazon Redshift.
No comments yet.