
Ultimate access to all questions.
Which of the following best describes the differences among the training, validation, and test data sub-samples, and how each is used?
A
Training data is used to evaluate the model's performance during the training process, validation data is used to build and finetune the model, and test data is used to identify issues or biases in the model.
B
Training data is used to build and finetune the model, validation data is used to evaluate the model's performance during the training process, and test data is used to evaluate the final performance of a ML model.
C
Training data is used to evaluate the final performance of a ML model, validation data is used to build and finetune the model, and test data is used to compare the model's performance to other models.
D
Training data is used to identify issues or biases in the model, validation data is used to evaluate the model's performance during the training process, and test data is used to build and finetune the model.
Explanation:
In machine learning model development, the data is divided into three distinct sub-samples with specific purposes:
This three-way split ensures proper model development, prevents overfitting, and provides reliable performance assessment.