
Answer-first summary for fast verification
Answer: Check if the number of bins is equal to or greater than the number of different category values in a column
To verify the sufficiency of bins for numerical features in a Databricks Decision Tree, it's essential to ensure the number of bins is at least equal to the number of distinct category values in the column. This can be achieved by executing a group by operation on the column to tally the unique category values. - **Option A** is not directly relevant as it assesses overall model performance rather than bin sufficiency. - **Option B** is incorrect because the power of 2 is not a standard or meaningful metric for determining bin adequacy. - **Option C** is also not directly relevant, as overfitting pertains to model complexity and training data fit, not specifically to bin count.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
How can you ensure the number of bins for numerical features in a Databricks Decision Tree is adequate?
A
Check if the model performance is satisfactory
B
Check if the number of bins is a power of 2
C
Check if the model is overfitting
D
Check if the number of bins is equal to or greater than the number of different category values in a column
No comments yet.