
Answer-first summary for fast verification
Answer: Identify word embeddings from a pre-trained model, and use the embeddings in your model.
The correct answer is B. Identifying word embeddings from a pre-trained model and using these embeddings in your model is the recommended approach. Word embeddings provide a dense vector representation of words, capturing their semantic meanings and relationships to other words, which recurrent neural networks (RNNs) can effectively utilize to understand context. This method is more efficient and effective compared to one-hot encoding or manual frequency sorting.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
You need to train a natural language model to perform text classification on a dataset containing millions of product descriptions, with a vocabulary size of 100,000 unique words. To effectively preprocess the words so they can be fed into a recurrent neural network (RNN) for better performance, what preprocessing step should you take?
A
Create a hot-encoding of words, and feed the encodings into your model.
B
Identify word embeddings from a pre-trained model, and use the embeddings in your model.
C
Sort the words by frequency of occurrence, and use the frequencies as the encodings in your model.
D
Assign a numerical value to each word from 1 to 100,000 and feed the values as inputs in your model.