
Ultimate access to all questions.
Deep dive into the quiz with AI chat providers.
We prepare a focused prompt with your quiz and certificate details so each AI can offer a more tailored, in-depth explanation.
A legal document assistant LLM is producing responses that are too short and overly conservative. The team wants the model to generate richer, more detailed answers without becoming random or hallucinating. Which adjustment best meets this goal?
A
Increase temperature to 0.9 and set top-p to 0.95
B
Decrease top-p to 0.4
C
Set temperature to 0.2
D
Increase top-k to 200
Explanation:
Correct Answer: A (Increase temperature to 0.9 and set top-p to 0.95)
Temperature (0.9): Increasing temperature to 0.9 makes the model's output more creative and diverse. Temperature controls the randomness of predictions - higher values (closer to 1.0) make the output more varied, while lower values (closer to 0) make it more deterministic and conservative.
Top-p (0.95): Setting top-p (nucleus sampling) to 0.95 allows the model to consider a broader range of tokens while still maintaining coherence. Top-p sampling selects from the smallest set of tokens whose cumulative probability exceeds p, so 0.95 includes most of the probability mass while filtering out very unlikely tokens.
Combination Effect: This combination encourages:
More creative, detailed responses (from higher temperature)
Maintains coherence and reduces hallucinations (from appropriate top-p)
Balances creativity with reliability
B. Decrease top-p to 0.4: This would make the model MORE conservative, as it restricts the token selection to only the most probable tokens, resulting in even shorter, safer responses.
C. Set temperature to 0.2: This would make the model MORE deterministic and conservative, producing even shorter and more predictable responses.
D. Increase top-k to 200: While increasing top-k expands the number of tokens considered, it doesn't directly address the creativity-conservativeness balance as effectively as adjusting temperature and top-p together.
Temperature: Controls randomness (0 = deterministic, 1 = creative)
Top-p (nucleus sampling): Controls token selection based on cumulative probability
Top-k: Controls the number of top tokens considered
For generating richer, more detailed legal document responses without hallucinations, the optimal approach is to increase creativity (temperature) while maintaining reasonable constraints (top-p) to prevent randomness.