Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
In a machine learning project that requires processing a vast amount of data spread across multiple nodes, which Spark ML component is optimized for efficient distributed computing and handling of large-scale datasets?
A
Spark SQL
B
Spark Streaming
C
Spark MLlib
D
Spark DataFrame