Ultimate access to all questions.
Upgrade Now 🚀
Sign in to unlock AI tutor
To optimize the performance of Spark jobs in a Databricks environment by reducing data shuffling during transformations, what strategy should a data engineering team consider?
A
Enable automatic optimization in the Databricks cluster settings.
B
Use the repartition method to control the number of partitions.
C
Increase the number of partitions in the DataFrame.
D
Decrease the number of worker nodes in the Spark cluster.