
Answer-first summary for fast verification
Answer: When creating your model, use BigQuery's TRANSFORM clause to define preprocessing stops. At prediction time, use BigQuery's ML.EVALUATE clause without specifying any transformations on the raw input data.
**Explanation:** Option A is the correct answer because: - **TRANSFORM clause** in BigQuery ML automatically applies the same preprocessing during training and prediction - **Prevents data skew** by ensuring consistent transformations across training and inference - **Simplified workflow** - no need for separate transformation steps at prediction time - **Native BigQuery ML feature** designed specifically for this use case **Reference:** https://cloud.google.com/bigquery-ml/docs/bigqueryml-transform Using the TRANSFORM clause, you can specify all preprocessing during model creation. The preprocessing is automatically applied during the prediction and evaluation phases of machine learning. Other options would require manual transformation steps that could introduce inconsistencies and data skew.
Author: LeetQuiz .
Ultimate access to all questions.
No comments yet.
NO.11 You work for a large real estate firm and are preparing 6 TB of home sales data to be used for machine learning. You will use SQL to transform the data and use BigQuery ML to create a machine learning model. You plan to use the model for predictions against a raw dataset that has not been transformed. How should you set up your workflow in order to prevent skew at prediction time?
A
When creating your model, use BigQuery's TRANSFORM clause to define preprocessing stops. At prediction time, use BigQuery's ML.EVALUATE clause without specifying any transformations on the raw input data.
B
When creating your model, use BigQuery's TRANSFORM clause to define preprocessing steps. Before requesting predictions, use a saved query to transform your raw input data, and then use ML.EVALUATE.
C
Use a BigQuery to define your preprocessing logic. When creating your model, use the view as your model training data. At prediction time, use BigQuery's ML EVALUATE clause without specifying any transformations on the raw input data.
D
Preprocess all data using Dataflow. At prediction time, use BigQuery's ML.EVALUATE clause without specifying any further transformations on the input data.