Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
In a network-intensive Spark application, which serialization format is most efficient for minimizing data transfer overhead and maximizing performance?
A
JSON serialization for its ease of use and compatibility with data processing pipelines.
B
Kryo serialization for its efficiency and speed over Java serialization.
C
Java serialization due to its native integration with Spark.
D
Parquet columnar format for serializing individual RDDs.