
Answer-first summary for fast verification
Answer: Use AWS Glue crawlers to periodically scan the data sources and update the data catalog.
To ensure that the AWS Glue Data Catalog is properly maintained and up-to-date with the latest data sources and schemas, you should use AWS Glue crawlers to periodically scan the data sources and update the data catalog. This automates the process and reduces the risk of manual errors. While manually updating the catalog or creating a custom script may work, they are not as efficient or reliable as using AWS Glue crawlers. The built-in versioning and CDC features in the AWS Glue Data Catalog are useful for tracking changes, but they do not automatically update the catalog.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
Your company is using AWS Glue to build a data lake. You need to ensure that the data catalog is properly maintained and up-to-date with the latest data sources and schemas. What best practices should you follow to achieve this?
A
Manually update the data catalog whenever a new data source or schema is added.
B
Use AWS Glue crawlers to periodically scan the data sources and update the data catalog.
C
Create a custom script that monitors the data sources and updates the data catalog when changes are detected.
D
Leverage the built-in versioning and change data capture (CDC) features in the AWS Glue Data Catalog to automatically track and update the catalog.
No comments yet.