AWS Certified Generative AI Developer - Professional

Get started today

Ultimate access to all questions.

Explanation:

Explanation

Option A is correct because it properly addresses the requirements for streaming responses:

Amazon API Gateway WebSocket API supports bidirectional communication, which is ideal for streaming data in real-time
AWS Lambda integration provides serverless compute to handle the Bedrock API calls
InvokeModelWithResponseStream API is specifically designed for streaming responses from Amazon Bedrock foundation models
Stream partial responses through WebSocket connections enables real-time delivery of response chunks as they become available

Why other options are incorrect:

Option B: Using REST API with polling every 100ms is inefficient and doesn't provide true streaming. It creates unnecessary network traffic and latency.

Option C: Direct client connections to Bedrock using IAM credentials is a security anti-pattern. Exposing IAM credentials to frontend clients is highly insecure and violates AWS security best practices.

Option D: Caching complete responses and serving them through paginated GET requests doesn't provide real-time streaming. This approach would wait for the entire response to complete before serving any part of it, defeating the purpose of streaming.

Key Concepts

Amazon Bedrock InvokeModelWithResponseStream: This API enables streaming responses from foundation models, allowing applications to receive and process responses incrementally
WebSocket APIs: Provide full-duplex communication channels over a single TCP connection, ideal for real-time applications
Security: Never expose AWS credentials to client-side applications; always use API Gateway or other secure proxy layers
Streaming vs Polling: Streaming provides immediate delivery of partial responses, while polling introduces latency and inefficiency

Explanation:

Explanation

Option A is correct because it properly addresses the requirements for streaming responses:

Amazon API Gateway WebSocket API supports bidirectional communication, which is ideal for streaming data in real-time
AWS Lambda integration provides serverless compute to handle the Bedrock API calls
InvokeModelWithResponseStream API is specifically designed for streaming responses from Amazon Bedrock foundation models
Stream partial responses through WebSocket connections enables real-time delivery of response chunks as they become available

Why other options are incorrect:

Option B: Using REST API with polling every 100ms is inefficient and doesn't provide true streaming. It creates unnecessary network traffic and latency.

Key Concepts

Amazon Bedrock InvokeModelWithResponseStream: This API enables streaming responses from foundation models, allowing applications to receive and process responses incrementally
WebSocket APIs: Provide full-duplex communication channels over a single TCP connection, ideal for real-time applications
Security: Never expose AWS credentials to client-side applications; always use API Gateway or other secure proxy layers
Streaming vs Polling: Streaming provides immediate delivery of partial responses, while polling introduces latency and inefficiency

Comments (0)

No comments yet.

A company is developing a customer support application that uses Amazon Bedrock foundation models (FMs) to provide real-time AI assistance to the company’s employees. The application must display AI- generated responses character by character as the responses are generated. The application needs to support thousands of concurrent users with minimal latency. The responses typically take 15 to 45 seconds to finish.

Which solution will meet these requirements?

Real Exam

Community

DDucse

Last updated: March 23, 2026 at 11:01

Configure an Amazon API Gateway WebSocket API with an AWS Lambda integration. Configure the WebSocket API to invoke the Amazon Bedrock InvokeModelWithResponseStream API and stream partial responses through WebSocket connections.

100.0%