
Ultimate access to all questions.
A data scientist has developed a two-class decision tree classifier using Spark ML and computed the predictions in a Spark DataFrame preds_df with the schema: prediction DOUBLE, actual DOUBLE. Which of the following code blocks correctly computes the model's accuracy from preds_df and assigns it to the accuracy variable?
A
accuracy = RegressionEvaluator(predictionCol="prediction", labelCol="actual", metricName="accuracy")
B
accuracy = MulticlassClassificationEvaluator(predictionCol="prediction", labelCol="actual", metricName="accuracy") accuracy = classification_evaluator.evaluate(preds_df)
C
classification_evaluator = BinaryClassificationEvaluator(predictionCol="prediction", labelCol="actual", metricName="accuracy")_
D
accuracy = Summarizer(predictionCol="prediction", labelCol="actual", metricName="accuracy")
E
classification_evaluator = BinaryClassificationEvaluator(rawPredictionCol="prediction", labelCol="actual", metricName="accuracy") accuracy = classification_evaluator.evaluate(preds_df)_