Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
What is a recommended approach to improve the performance of a disk I/O intensive Hadoop job that's running slowly on Cloud Dataproc, especially when using the Cloud Storage connector for data storage?
A
Enhance the virtual machine instances' CPU cores to boost networking bandwidth for each instance
B
Increase the Hadoop cluster's memory allocation to keep the intermediary data of the slow Hadoop job in memory
C
Assign additional network interface cards (NICs) and configure link aggregation in the OS to utilize combined throughput with Cloud Storage
D
Provide adequate persistent disk space to the Hadoop cluster and store the intermediate data on native Hadoop Distributed File System (HDFS)