
Ultimate access to all questions.
In a scenario where you are using AutoML to develop a predictive model for a large-scale e-commerce platform, describe the steps you would take to ensure the data used for training is of high quality. Discuss the potential challenges and how you would address them.
A
Perform data cleaning, handle missing values, remove outliers, and ensure data consistency.
B
Collect more data, merge data from different sources, normalize data, and perform feature selection.
C
Validate data schema, check for data duplication, standardize data formats, and perform data augmentation.
D
Audit data sources, verify data integrity, apply data transformations, and perform data balancing.