
Answer-first summary for fast verification
Answer: Use Amazon S3 as a persistent data store., Use Graviton instances for core nodes and task nodes.
For long-running and cost-optimized workloads, using Amazon S3 as a persistent data store (EMRFS) is a best practice as it decouples compute from storage. Additionally, AWS Graviton instances provide a better price-performance ratio, making them the most cost-effective choice for computing without sacrificing performance compared to x86-based instances.
Author: Ritesh Yadav
Ultimate access to all questions.
Question 4
A company is planning to use a provisioned Amazon EMR cluster that runs Apache Spark jobs to perform big data analysis. The company requires high reliability. A big data team must follow best practices for running cost-optimized and long-running workloads on Amazon EMR. The team must find a solution that will maintain the company's current level of performance. Which combination of resources will meet these requirements MOST cost-effectively? (Select TWO.)
A
Use Hadoop Distributed File System (HDFS) as a persistent data store.
B
Use Amazon S3 as a persistent data store.
C
Use x86-based instances for core nodes and task nodes.
D
Use Graviton instances for core nodes and task nodes.
E
Use Spot Instances for all primary nodes.
No comments yet.