Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
In the context of optimizing data distribution across nodes in a Microsoft Azure Synapse Analytics environment using a custom partitioner for a Spark RDD, which of the following factors is considered the least important?
A
The hash function used for partitioning
B
The size of each partition
C
The number of partitions
D
Network bandwidth between Spark and Azure Synapse