Explanation
In Amazon Bedrock Knowledge Base, source documents (such as PDFs, text files, JSON files, etc.) are typically stored in Amazon S3 buckets.
Key Points:
- Amazon S3 (Simple Storage Service) is AWS's object storage service designed for storing and retrieving any amount of data from anywhere.
- When creating a Knowledge Base in Amazon Bedrock, you specify an S3 bucket location where your source documents are stored.
- Bedrock then ingests these documents from S3, processes them, and creates vector embeddings that are stored in a vector database (like Amazon OpenSearch Service, Pinecone, etc.).
- The other options are not typically used for storing source documents in Bedrock Knowledge Base:
- DynamoDB: A NoSQL database service, not typically used for storing raw source documents
- Kinesis Data Streams: A real-time data streaming service
- RDS MySQL: A relational database service, not optimized for storing large documents like PDFs
Why S3 is the correct choice:
- S3 provides scalable, durable, and secure storage for unstructured data
- It's cost-effective for storing large volumes of documents
- S3 integrates seamlessly with other AWS services including Bedrock
- Documents in S3 can be easily managed, versioned, and secured with IAM policies