
Ultimate access to all questions.
A data scientist developed two regression models: one using 'price' as the label variable and another using 'log(price)'. Upon evaluating both models using RMSE by comparing predicted values to actual prices, the RMSE for the second model is significantly higher than the first. What is the most valid explanation for this discrepancy? Choose the single best answer.
A
The second model is significantly more accurate than the first.
B
The data scientist did not apply the log transformation to the predictions of the first model before calculating RMSE.
C
The data scientist neglected to exponentiate the predictions of the second model to revert them to the original scale before RMSE calculation.
D
RMSE is not a suitable metric for evaluating regression models.
E
The first model is considerably more accurate than the second.