AWS Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

You are working on a data pipeline that processes data from a marketing company. The data includes customer feedback records with information about customer satisfaction and preferences. You have been tasked with ensuring the data quality of the customer feedback records dataset. Describe the steps you would take to run data quality checks on the customer feedback records dataset and explain how you would define data quality rules to identify and filter out irrelevant or duplicate feedback records.

Simulated

Run data quality checks by manually inspecting each customer feedback record and identifying irrelevant or duplicate records.

0.0%

Use AWS Glue to run data quality checks by writing custom scripts that identify irrelevant or duplicate records based on specific criteria.

Comments

Loading comments...

Define data quality rules using AWS Glue DataBrew by creating a new project, selecting the customer feedback records dataset, and specifying rules to identify and filter out irrelevant or duplicate records.

100.0%