Ultimate access to all questions.
You need to train a natural language model to perform text classification on a dataset containing millions of product descriptions, with a vocabulary size of 100,000 unique words. To effectively preprocess the words so they can be fed into a recurrent neural network (RNN) for better performance, what preprocessing step should you take?