You are tasked with developing a natural language processing (NLP) model for text classification on a dataset that includes millions of product descriptions and 100,000 unique words. The model will be implemented using a recurrent neural network (RNN). Given the scale of the dataset and the complexity of the task, which preprocessing method is most appropriate for preparing the words as inputs to the RNN? Consider the need for efficiency, scalability, and the ability to capture semantic relationships between words. Choose the best option from the following: | Google Professional Machine Learning Engineer Quiz