
Explanation:
Option A is correct because it properly addresses the requirements for streaming responses:
Why other options are incorrect:
Option B: Using REST API with polling every 100ms is inefficient and doesn't provide true streaming. It creates unnecessary network traffic and latency.
Option C: Direct client connections to Bedrock using IAM credentials is a security anti-pattern. Exposing IAM credentials to frontend clients is highly insecure and violates AWS security best practices.
Option D: Caching complete responses and serving them through paginated GET requests doesn't provide real-time streaming. This approach would wait for the entire response to complete before serving any part of it, defeating the purpose of streaming.
Ultimate access to all questions.
No comments yet.
A company is developing a customer support application that uses Amazon Bedrock foundation models (FMs) to provide real-time AI assistance to the company’s employees. The application must display AI- generated responses character by character as the responses are generated. The application needs to support thousands of concurrent users with minimal latency. The responses typically take 15 to 45 seconds to finish.
Which solution will meet these requirements?
A
Configure an Amazon API Gateway WebSocket API with an AWS Lambda integration. Configure the WebSocket API to invoke the Amazon Bedrock InvokeModelWithResponseStream API and stream partial responses through WebSocket connections.
B
Configure an Amazon API Gateway REST API with an AWS Lambda integration. Configure the REST API to invoke the Amazon Bedrock standard InvokeModel API and implement frontend client-side polling every 100 ms for complete response chunks.
C
Implement direct frontend client connections to Amazon Bedrock by using IAM user credentials and the InvokeModelWithResponseStream API without any intermediate gateway or proxy layer.
D
Configure an Amazon API Gateway HTTP API with an AWS Lambda integration. Configure the HTTP API to cache complete responses in an Amazon DynamoDB table and serve the responses through multiple paginated GET requests to frontend clients.