
Answer-first summary for fast verification
Answer: Enable job bookmarks for the ETL jobs to update the state after a run to keep track of previously processed data.
AWS Glue Job Bookmarks help Glue maintain state information and prevent reprocessing of old data. This allows jobs to process only the incremental data without requiring custom state tracking.
Author: Ritesh Yadav
Ultimate access to all questions.
Question 26\n\nA company has developed several AWS Glue extract, transform, and load (ETL) jobs to validate and transform data from Amazon S3. The ETL jobs load the data into Amazon RDS for MySQL in batches once every day. The ETL jobs use a DynamicFrame to read the S3 data. The ETL jobs currently process all the data that is in the S3 bucket. However, the company wants the jobs to process only the daily incremental data. Which solution will meet this requirement with the LEAST coding effort?
A
Create an ETL job that reads the S3 file status and logs the status in Amazon DynamoDB.
B
Enable job bookmarks for the ETL jobs to update the state after a run to keep track of previously processed data.
C
Enable job metrics for the ETL jobs to help keep track of processed objects in Amazon CloudWatch.
D
Configure the ETL jobs to delete processed objects from Amazon S3 after each run.
No comments yet.