Google Professional Machine Learning Engineer

Get started today

Ultimate access to all questions.

In the context of preparing data for machine learning models, data transformation plays a pivotal role. A team is working on a project that involves predicting customer churn for a telecom company. The dataset includes customer demographics, service usage, and complaint history. The raw data is messy, with missing values, inconsistent formats, and categorical variables not suitable for direct input into machine learning algorithms. The team needs to preprocess this data to make it suitable for analysis. Which of the following best describes the process of 'data transformation' in this scenario? Choose the best option.

Real Exam

Expanding the dataset by integrating additional data sources such as social media activity to enhance predictive accuracy.

0.0%

The process of gathering raw data from various internal and external sources to compile a comprehensive dataset.

Comments

Loading comments...

Manipulating and converting raw data into a format that's ready for analysis, including handling missing values, normalizing numerical data, and encoding categorical variables.

100.0%