AWS Certified Cloud Practitioner

Get started today

Ultimate access to all questions.

Explanation:

Explanation

Correct Answer: A (Increase temperature to 0.9 and set top-p to 0.95)

Why Option A is Correct:

Temperature (0.9): Increasing temperature to 0.9 makes the model's output more creative and diverse. Temperature controls the randomness of predictions - higher values (closer to 1.0) make the output more varied, while lower values (closer to 0) make it more deterministic and conservative.
Top-p (0.95): Setting top-p (nucleus sampling) to 0.95 allows the model to consider a broader range of tokens while still maintaining coherence. Top-p sampling selects from the smallest set of tokens whose cumulative probability exceeds p, so 0.95 includes most of the probability mass while filtering out very unlikely tokens.
Combination Effect: This combination encourages:
- More creative, detailed responses (from higher temperature)
- Maintains coherence and reduces hallucinations (from appropriate top-p)
- Balances creativity with reliability

Why Other Options Are Incorrect:

B. Decrease top-p to 0.4: This would make the model MORE conservative, as it restricts the token selection to only the most probable tokens, resulting in even shorter, safer responses.

C. Set temperature to 0.2: This would make the model MORE deterministic and conservative, producing even shorter and more predictable responses.

D. Increase top-k to 200: While increasing top-k expands the number of tokens considered, it doesn't directly address the creativity-conservativeness balance as effectively as adjusting temperature and top-p together.

Key Concepts:

Temperature: Controls randomness (0 = deterministic, 1 = creative)
Top-p (nucleus sampling): Controls token selection based on cumulative probability
Top-k: Controls the number of top tokens considered

For generating richer, more detailed legal document responses without hallucinations, the optimal approach is to increase creativity (temperature) while maintaining reasonable constraints (top-p) to prevent randomness.

Explanation:

Explanation

Correct Answer: A (Increase temperature to 0.9 and set top-p to 0.95)

Why Option A is Correct:

Temperature (0.9): Increasing temperature to 0.9 makes the model's output more creative and diverse. Temperature controls the randomness of predictions - higher values (closer to 1.0) make the output more varied, while lower values (closer to 0) make it more deterministic and conservative.
Top-p (0.95): Setting top-p (nucleus sampling) to 0.95 allows the model to consider a broader range of tokens while still maintaining coherence. Top-p sampling selects from the smallest set of tokens whose cumulative probability exceeds p, so 0.95 includes most of the probability mass while filtering out very unlikely tokens.
Combination Effect: This combination encourages:
- More creative, detailed responses (from higher temperature)
- Maintains coherence and reduces hallucinations (from appropriate top-p)
- Balances creativity with reliability

Why Other Options Are Incorrect:

B. Decrease top-p to 0.4: This would make the model MORE conservative, as it restricts the token selection to only the most probable tokens, resulting in even shorter, safer responses.

C. Set temperature to 0.2: This would make the model MORE deterministic and conservative, producing even shorter and more predictable responses.

Key Concepts:

Temperature: Controls randomness (0 = deterministic, 1 = creative)
Top-p (nucleus sampling): Controls token selection based on cumulative probability
Top-k: Controls the number of top tokens considered

Comments (0)

No comments yet.

A legal document assistant LLM is producing responses that are too short and overly conservative. The team wants the model to generate richer, more detailed answers without becoming random or hallucinating. Which adjustment best meets this goal?

Real Exam

Community

RRitesh

Last updated: January 10, 2026 at 14:02

Increase temperature to 0.9 and set top-p to 0.95

35.7%

Decrease top-p to 0.4

28.6%