
Explanation:
The question requires a storage level that maximizes in-memory storage on two nodes while spilling excess data to disk. MEMORY_AND_DISK_2 (option D) uses memory and disk (spilling when memory is full) and replicates data across two nodes. This ensures partitions are stored in memory on both nodes if possible, or on disk if memory is insufficient. Other options either lack replication (B, C, E) or do not spill to disk (A, E).
Ultimate access to all questions.
Which of the following storage levels should be used to maximize in-memory storage across two cluster nodes while spilling excess data to disk for on-demand retrieval?
A
MEMORY_ONLY_2
B
MEMORY_AND_DISK_SER
C
MEMORY_AND_DISK
D
MEMORY_AND_DISK_2
E
MEMORY_ONLY