AWS Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

You are working on a data processing project that involves analyzing social media posts to identify trends and sentiment. The data includes a mix of structured and unstructured data, with high velocity and volume. Describe how you would design an ETL pipeline to handle this data, and explain the role of intermediate data staging locations in the pipeline.

Simulated

Use a single-stage ETL process to load the data directly into a data lake and perform all transformations and analysis there.

0.0%

Create a multi-stage ETL pipeline with intermediate data staging locations to handle the different data types and sources, and perform transformations at each stage.

Comments

Loading comments...

Use a traditional batch processing approach to handle the data, as it is more cost-effective than using a distributed computing framework like Apache Spark.