Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
When optimizing serialization for complex nested data structures in a Spark application, which serialization library or format is most effective for minimizing memory usage and maximizing deserialization speed?
A
Parquet, leveraging its columnar storage format for efficient partial deserialization.
B
Avro, due to its compact binary format and schema evolution capabilities.
C
Java serialization due to its automatic handling of complex types.
D
Kryo serialization with custom serializers for nested structures.