You are tasked with developing a machine learning pipeline for training an XGBoost classification model using tabular data stored in a BigQuery table. Your goal is to ensure that the pipeline effectively handles data splitting, feature engineering, and model evaluation, and allows for easy comparison of different models. The required steps are: 1. Randomly split the data into training and evaluation datasets in a 65/35 ratio. 2. Conduct feature engineering to prepare the data for training. 3. Obtain evaluation metrics for the model's performance. 4. Compare the performance of models trained in different pipeline executions. Which approach should you take to achieve these requirements? | Google Professional Machine Learning Engineer Quiz - LeetQuiz

You are tasked with developing a machine learning pipeline for training an XGBoost classification model using tabular data stored in a BigQuery table. Your goal is to ensure that the pipeline effectively handles data splitting, feature engineering, and model evaluation, and allows for easy comparison of different models. The required steps are: 1. Randomly split the data into training and evaluation datasets in a 65/35 ratio. 2. Conduct feature engineering to prepare the data for training. 3. Obtain evaluation metrics for the model's performance. 4. Compare the performance of models trained in different pipeline executions. Which approach should you take to achieve these requirements?

Exam-Like

A

Using Vertex AI Pipelines, add a component to divide the data into training and evaluation sets, and add another component for feature engineering. 2. Enable autologging of metrics in the training component. 3. Compare pipeline runs in Vertex AI Experiments.

53.0%

B

Using Vertex AI Pipelines, add a component to divide the data into training and evaluation sets, and add another component for feature engineering. 2. Enable autologging of metrics in the training component. 3. Compare models using the artifacts’ lineage in Vertex ML Metadata.

12.0%

C

In BigQuery ML, use the CREATE MODEL statement with BOOSTED_TREE_CLASSIFIER as the model type and use BigQuery to handle the data splits. 2. Use a SQL view to apply feature engineering and train the model using the data in that view. 3. Compare the evaluation metrics of the models by using a SQL query with the ML.TRAINING_INFO statement._

15.4%

D

In BigQuery ML, use the CREATE MODEL statement with BOOSTED_TREE_CLASSIFIER as the model type and use BigQuery to handle the data splits. 2. Use ML TRANSFORM to specify the feature engineering transformations and train the model using the data in the table. 3. Compare the evaluation metrics of the models by using a SQL query with the ML.TRAINING_INFO statement._

19.7%

Powered ByGPT-5.2