Ultimate access to all questions.
In the context of Apache Spark, explain how decision trees are scaled and parallelized. Discuss the specific mechanisms Spark uses to distribute the training of decision trees, including the handling of feature selection, node splitting, and the aggregation of results across multiple nodes.