AWS Certified Generative AI Developer - Professional

Get started today

Ultimate access to all questions.

Explanation:

Explanation

Why Option B is correct:

Adaptive retry strategy with exponential backoff and jitter: This addresses transient service errors during peak usage periods by gradually increasing wait times between retries, preventing overwhelming Amazon Bedrock with immediate retries.
Circuit breaker pattern: This adapts to changing service availability by temporarily disabling retries when error rates exceed thresholds, preventing cascading failures and allowing the service to recover.
Streaming response handler with intelligent resume: This specifically handles broken response chunks by buffering successfully received chunks and resuming from the last received chunk, ensuring complete responses even with intermittent timeouts.
Comprehensive approach: The solution addresses all requirements: transient error handling, prevention of service overload, adaptation to changing availability, and support for response streaming.

Why Option A is incorrect:

Fixed 1-second delay doesn't adapt to service load
Simple restart of streams doesn't handle partial chunk delivery
Token capping doesn't address the adaptive requirements

Why Option C is incorrect:

Standard mode retry strategy isn't adaptive enough
Returning cached completions for failed streaming requests doesn't provide accurate responses
Global token limit doesn't handle varying inquiry lengths intelligently
Lightweight token management doesn't address the complex token-aware requirements

Key AWS Best Practices Applied:

Exponential backoff with jitter for retries
Circuit breaker pattern for fault tolerance
Intelligent streaming resume for partial responses
Adaptive strategies that respond to service health

Explanation:

Explanation

Why Option B is correct:

Adaptive retry strategy with exponential backoff and jitter: This addresses transient service errors during peak usage periods by gradually increasing wait times between retries, preventing overwhelming Amazon Bedrock with immediate retries.
Circuit breaker pattern: This adapts to changing service availability by temporarily disabling retries when error rates exceed thresholds, preventing cascading failures and allowing the service to recover.
Streaming response handler with intelligent resume: This specifically handles broken response chunks by buffering successfully received chunks and resuming from the last received chunk, ensuring complete responses even with intermittent timeouts.
Comprehensive approach: The solution addresses all requirements: transient error handling, prevention of service overload, adaptation to changing availability, and support for response streaming.

Why Option A is incorrect:

Fixed 1-second delay doesn't adapt to service load
Simple restart of streams doesn't handle partial chunk delivery
Token capping doesn't address the adaptive requirements

Why Option C is incorrect:

Standard mode retry strategy isn't adaptive enough
Returning cached completions for failed streaming requests doesn't provide accurate responses
Global token limit doesn't handle varying inquiry lengths intelligently
Lightweight token management doesn't address the complex token-aware requirements

Key AWS Best Practices Applied:

Exponential backoff with jitter for retries
Circuit breaker pattern for fault tolerance
Intelligent streaming resume for partial responses
Adaptive strategies that respond to service health

Comments (0)

No comments yet.

A company is building a generative AI (GenAI) application that uses Amazon Bedrock APIs to process complex customer inquiries. During peak usage periods, the application experiences intermittent API timeouts that cause issues such as broken response chunks and delayed data delivery. The application struggles to ensure that prompts remain within token limits when handling complex customer inquiries of varying lengths. Users have reported truncated inputs and incomplete responses. The company has also observed foundation model (FM) invocation failures.

The company needs a retry strategy that automatically handles transient service errors and prevents overwhelming Amazon Bedrock during peak usage periods. The strategy must also adapt to changing service availability and support response streaming and token-aware request handling.

Which solution will meet these requirements?

Real Exam

Community

DDucse

Last updated: March 23, 2026 at 10:56

Implement a standard retry strategy that uses a 1-second fixed delay between attempts and a 3-retry maximum for all errors. Handle streaming response timeouts by restarting streams. Cap token usage for each session.

5.9%

Implement an adaptive retry strategy that uses exponential backoff with jitter and a circuit breaker pattern that temporarily disables retries when error rates exceed a predefined threshold. Implement a streaming response handler that monitors for chunk delivery timeouts. Configure the handler to buffer successfully received chunks and intelligently resume streaming from the last received chunk when connections are re-established.

76.5%