In a scenario where you are using AutoML to develop a predictive model for a large-scale e-commerce platform, describe the steps you would take to ensure the data used for training is of high quality. Discuss the potential challenges and how you would address them.