
Ultimate access to all questions.
When developing a linear regression model in BigQuery ML to predict a customer's likelihood of purchasing your company's products, city names are a key predictive component. However, the data must be organized into columns for both training and serving the model. What is the most efficient method to prepare this data?
A
Use Cloud Data Fusion to assign a number to each city based on its region and represent it with that number in the model
B
Create a new view in BigQuery that excludes the city column
C
Use SQL in BigQuery to apply one-hot encoding to the state column and convert each city to a binary value column
D
Use TensorFlow to generate a categorical variable with a vocabulary list and a vocabulary file that can be uploaded to BigQuery ML