
Answer-first summary for fast verification
Answer: Decrease temperature
## Explanation **Temperature parameter** affects output creativity vs. factuality: - **Lower temperature (0.0-0.3)**: More focused, deterministic, and factual responses - **Higher temperature (0.7-1.0)**: More creative, imaginative, and potentially off-topic **Why A is correct**: - The problem describes "imaginative but off-topic answers" - classic sign of high temperature - Decreasing temperature makes the model more focused and factual - This reduces randomness and keeps responses more grounded in the actual query **Why others are incorrect**: - **B**: Increasing top-p would allow more diverse token selection, potentially making responses more creative - **C**: Raising max-tokens only affects response length, not content quality - **D**: Disabling stop-sequences could make responses run on without proper ending
Author: Ritesh Yadav
Ultimate access to all questions.
No comments yet.