AWS Certified Data Engineer - Associate

Get started today

Ultimate access to all questions.

Explanation:

To use Athena notebooks with Apache Spark, you would need to set up an Amazon EMR cluster, which is a managed cluster platform that simplifies running big data frameworks like Apache Spark. Configuring Spark on EMR allows you to leverage the capabilities of Spark for data processing and analysis. Using Jupyter notebooks on EMR provides an interactive environment for exploring the dataset, making it easier to perform complex data manipulations and analyses.

Explanation:

Comments (0)

No comments yet.

Consider a scenario where you need to explore a large dataset stored in S3 using Athena notebooks that use Apache Spark. What are the key considerations and steps you would take to set up and use these notebooks effectively?

Simulated

Last updated: February 3, 2026 at 14:03

Set up an EMR cluster, configure Spark, and use Jupyter notebooks on EMR.

47.6%

Use AWS Glue to transform the data, and then use Athena notebooks.

23.8%

Set up an AWS Glue job, configure Spark, and use Athena notebooks.

14.3%

Use Amazon SageMaker notebooks with pre-configured Spark environments.

14.3%