AWS Certified Data Engineer - Associate

Ultimate access to all questions.

Your company is using AWS Glue to process and analyze data from various sources. You need to ensure that the data catalog is scalable and can handle an increasing volume of data sources and schemas. What best practices should you follow to achieve this?

Simulated

Periodically review and optimize the data catalog structure to accommodate the growing volume of data sources and schemas.

5.9%

Implement data partitioning and indexing strategies in the data catalog to improve query performance and scalability.

Loading comments...

Use AWS Glue crawlers to automatically discover and catalog new data sources and schemas as they are added.

35.3%

Leverage AWS Lake Formation to manage and govern the data catalog, ensuring scalability and efficient access control.

14.7%