AWS Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

Your company is planning to implement a data lake architecture to store and process large volumes of diverse data. Describe the steps you would take to design and implement an ETL pipeline for a data lake, and explain the considerations involved in handling the data in its native format.

Simulated

Use a traditional data warehouse architecture and store the data in a structured format, as it is more suitable for handling large volumes of diverse data.

0.0%

Design an ETL pipeline that ingests data in its native format, performs minimal transformations, and stores it in a data lake for further processing and analysis.

Comments

Loading comments...

Perform extensive data cleansing and transformation before ingesting the data into the data lake, to ensure data quality and consistency.

20.0%

Use a single-stage ETL process to load all data into the data lake and perform all transformations and analysis there, without considering data governance and security.

5.0%