
Answer-first summary for fast verification
Answer: Increase temperature to 0.9 and set top-p to 0.95
## Explanation **Correct Answer: A (Increase temperature to 0.9 and set top-p to 0.95)** ### Why Option A is Correct: 1. **Temperature (0.9)**: Increasing temperature to 0.9 makes the model's output more creative and diverse. Temperature controls the randomness of predictions - higher values (closer to 1.0) make the output more varied, while lower values (closer to 0) make it more deterministic and conservative. 2. **Top-p (0.95)**: Setting top-p (nucleus sampling) to 0.95 allows the model to consider a broader range of tokens while still maintaining coherence. Top-p sampling selects from the smallest set of tokens whose cumulative probability exceeds p, so 0.95 includes most of the probability mass while filtering out very unlikely tokens. 3. **Combination Effect**: This combination encourages: - More creative, detailed responses (from higher temperature) - Maintains coherence and reduces hallucinations (from appropriate top-p) - Balances creativity with reliability ### Why Other Options Are Incorrect: **B. Decrease top-p to 0.4**: This would make the model MORE conservative, as it restricts the token selection to only the most probable tokens, resulting in even shorter, safer responses. **C. Set temperature to 0.2**: This would make the model MORE deterministic and conservative, producing even shorter and more predictable responses. **D. Increase top-k to 200**: While increasing top-k expands the number of tokens considered, it doesn't directly address the creativity-conservativeness balance as effectively as adjusting temperature and top-p together. ### Key Concepts: - **Temperature**: Controls randomness (0 = deterministic, 1 = creative) - **Top-p (nucleus sampling)**: Controls token selection based on cumulative probability - **Top-k**: Controls the number of top tokens considered For generating richer, more detailed legal document responses without hallucinations, the optimal approach is to increase creativity (temperature) while maintaining reasonable constraints (top-p) to prevent randomness.
Author: Ritesh Yadav
Ultimate access to all questions.
No comments yet.
A legal document assistant LLM is producing responses that are too short and overly conservative. The team wants the model to generate richer, more detailed answers without becoming random or hallucinating. Which adjustment best meets this goal?
A
Increase temperature to 0.9 and set top-p to 0.95
B
Decrease top-p to 0.4
C
Set temperature to 0.2
D
Increase top-k to 200