
Answer-first summary for fast verification
Answer: Ensure that the object format, compression type, and schema are the same for each object., Ensure that the structure of the prefix for each S3 object name is consistent.
Option A is CORRECT because ensuring that the object format, compression type, and schema are consistent for each object allows the AWS Glue crawler to recognize all objects as belonging to a single dataset. This uniformity is crucial because the crawler uses these properties to infer the structure of the data. When the objects have the same format, compression type, and schema, the crawler can combine the data into a single table instead of creating multiple tables. Option D is CORRECT because maintaining a consistent structure of the prefix for each S3 object name helps the AWS Glue crawler understand that the objects belong to the same logical group or dataset. A uniform prefix structure in the S3 paths helps the crawler recognize the objects as part of the same dataset, which in turn leads to the creation of a single table in the data catalog.
Author: Ritesh Yadav
Ultimate access to all questions.
Question 20/58
A finance company receives data from third-party data providers and stores the data as objects in an Amazon S3 bucket.
The company ran an AWS Glue crawler on the objects to create a data catalog. The AWS Glue crawler created multiple tables. However, the company expected that the crawler would create only one table.
The company needs a solution that will ensure the AWS Glue crawler creates only one table.
Which combination of solutions will meet this requirement? (Choose two.)
A
Ensure that the object format, compression type, and schema are the same for each object.
B
Ensure that the object format and schema are the same for each object. Do not enforce consistency for the compression type of each object.
C
Ensure that the schema is the same for each object. Do not enforce consistency for the file format and compression type of each object.
D
Ensure that the structure of the prefix for each S3 object name is consistent.
E
Ensure that all S3 object names follow a similar pattern.
No comments yet.