Explanation
Correct Answer: A (Increase Top-P to 0.95)
Why this is correct:
- Top-P (Nucleus Sampling) controls the diversity of word choices by limiting the vocabulary to the smallest set of words whose cumulative probability exceeds P.
- Higher Top-P values (closer to 1.0) allow the model to consider a wider range of possible words, increasing vocabulary diversity.
- Top-P = 0.95 means the model will consider words that make up 95% of the probability mass, allowing for more diverse and creative word choices.
Why the other options are incorrect:
- B (Decrease temperature to 0.2): Lower temperature values make the model more deterministic and less creative. Temperature controls randomness - lower values make the model choose the most probable words, reducing diversity.
- C (Reduce max-tokens to 100): This only limits the length of the output, not the diversity of word choices within that output.
- D (Add stop-sequence after each sentence): This controls when generation stops, not the diversity of vocabulary used.
Key Concepts:
- Top-P (Nucleus Sampling): Higher values = more diverse vocabulary
- Temperature: Higher values = more random/creative, lower values = more deterministic
- For creative writing and style variety, increasing Top-P is the most direct way to encourage diverse word choices.