
Ultimate access to all questions.
You are tasked with preparing 6 TB of home sales data for machine learning using SQL and BigQuery ML in a large real estate firm. The goal is to use the resulting model for predictions against a raw dataset. To ensure predictions are not skewed, which workflow should you adopt?
A
Transform your raw input data using a saved query before requesting predictions and then use ML.EVALUATE, after defining preprocessing steps with BigQuery‘s TRANSFORM clause during model creation.
B
Define your preprocessing logic using a BigQuery view for model training data. At prediction time, use BigQuery‘s ML.EVALUATE clause without any transformations on the raw input data.
C
Preprocess all data using Dataflow. At prediction time, use BigQuery‘s ML.EVALUATE clause without any further transformations on the input data.
D
Use BigQuery‘s TRANSFORM clause to define preprocessing steps when creating your model. Then, at prediction time, use BigQuery‘s ML.EVALUATE clause without specifying any transformations on the raw input data.