
Explanation:
The correct answer is D) Hyperopt selects parallelism when execution begins, and autoscaling may affect this configuration. Here's why:
Hyperopt's Parallelism Selection: At the start of a Hyperopt run with SparkTrials, the parallelism level is determined based on the current number of Spark executors. This setting usually stays the same throughout the tuning process.
Autoscaling Cluster Dynamics: Autoscaling clusters adjust the number of executors dynamically based on workload, which means the executor count can change during a Hyperopt run.
Potential Issue: Autoscaling after Hyperopt has set the parallelism level can lead to inefficiencies:
Recommendations:
Key Points:
Ultimate access to all questions.
What is the reason SparkTrials should not be used on autoscaling clusters, and what potential issue can this lead to?
A
Hyperopt cannot select the parallelism value on autoscaling clusters
B
Autoscaling clusters do not support SparkTrials configuration
C
SparkTrials is incompatible with autoscaling clusters
D
Hyperopt selects parallelism when execution begins, and autoscaling may affect this configuration
No comments yet.