Google Professional Data Engineer

Ultimate access to all questions.

When developing a linear regression model in BigQuery ML to predict a customer's likelihood of purchasing your company's products, city names are a key predictive component. However, the data must be organized into columns for both training and serving the model. What is the most efficient method to prepare this data?

Real Exam

Use Cloud Data Fusion to assign a number to each city based on its region and represent it with that number in the model

25.0%

Create a new view in BigQuery that excludes the city column

0.0%

Loading comments...

Use TensorFlow to generate a categorical variable with a vocabulary list and a vocabulary file that can be uploaded to BigQuery ML

18.8%