AWS Certified AI Practitioner

Get started today

Ultimate access to all questions.

Explanation:

Detailed Explanation

Question Analysis

The question asks for an AWS database service that can store and query vector embeddings from a generative AI model, specifically for a conversational search application. Vector embeddings are high-dimensional numerical representations of data (like text, images, or audio) that capture semantic meaning, enabling similarity searches—a core requirement for intelligent agents and conversational interfaces.

Evaluation of Options

A: Amazon Athena

Not suitable: Athena is a serverless interactive query service for analyzing data in Amazon S3 using standard SQL. It is designed for analytics on structured and semi-structured data stored in object storage, not for real-time storage and querying of vector embeddings. It lacks native vector data type support and efficient similarity search capabilities required for this use case.

B: Amazon Aurora PostgreSQL

Optimal choice: Aurora PostgreSQL is a fully managed relational database service compatible with PostgreSQL. It supports the pgvector extension, which adds vector data types and efficient similarity search operations (e.g., cosine similarity, Euclidean distance). This makes it ideal for storing and querying high-dimensional embeddings from generative AI models. It provides:
- Native vector storage with optimized indexing (e.g., IVFFlat, HNSW).
- Integration with AI/ML workflows for real-time retrieval in conversational applications.
- Scalability, durability, and managed operations typical of AWS database services.

C: Amazon Redshift

Less suitable: Redshift is a cloud data warehouse optimized for large-scale analytics and complex queries on structured data. While it supports some vector operations through extensions or custom functions, it is not purpose-built for real-time vector similarity searches. Its architecture is geared toward batch processing and analytical workloads, not the low-latency retrieval needed for conversational agents.

D: Amazon EMR

Not suitable: EMR (Elastic MapReduce) is a big data platform for processing large datasets using frameworks like Apache Spark and Hadoop. It is designed for data transformation, batch processing, and machine learning pipelines, not for serving as a database for real-time vector queries. It lacks built-in vector storage and query capabilities.

Conclusion

Amazon Aurora PostgreSQL (option B) is the best fit because it directly addresses the requirement through PostgreSQL's pgvector extension, enabling efficient storage and similarity searches of vector embeddings. This aligns with AWS best practices for building AI-powered applications, where managed databases with vector support reduce operational overhead and ensure performance for conversational search experiences.

Explanation:

Detailed Explanation

Question Analysis

Evaluation of Options

A: Amazon Athena

Not suitable: Athena is a serverless interactive query service for analyzing data in Amazon S3 using standard SQL. It is designed for analytics on structured and semi-structured data stored in object storage, not for real-time storage and querying of vector embeddings. It lacks native vector data type support and efficient similarity search capabilities required for this use case.

B: Amazon Aurora PostgreSQL

Optimal choice: Aurora PostgreSQL is a fully managed relational database service compatible with PostgreSQL. It supports the pgvector extension, which adds vector data types and efficient similarity search operations (e.g., cosine similarity, Euclidean distance). This makes it ideal for storing and querying high-dimensional embeddings from generative AI models. It provides:
- Native vector storage with optimized indexing (e.g., IVFFlat, HNSW).
- Integration with AI/ML workflows for real-time retrieval in conversational applications.
- Scalability, durability, and managed operations typical of AWS database services.

C: Amazon Redshift

Less suitable: Redshift is a cloud data warehouse optimized for large-scale analytics and complex queries on structured data. While it supports some vector operations through extensions or custom functions, it is not purpose-built for real-time vector similarity searches. Its architecture is geared toward batch processing and analytical workloads, not the low-latency retrieval needed for conversational agents.

D: Amazon EMR

Not suitable: EMR (Elastic MapReduce) is a big data platform for processing large datasets using frameworks like Apache Spark and Hadoop. It is designed for data transformation, batch processing, and machine learning pipelines, not for serving as a database for real-time vector queries. It lacks built-in vector storage and query capabilities.

Conclusion

Comments (0)

No comments yet.

Which AWS service should be used to store and query vector embeddings from a generative AI model for a conversational search application?

Exam-Like

Last updated: May 16, 2026 at 14:02

Amazon Athena

7.7%

Amazon Aurora PostgreSQL

76.9%

Amazon Redshift

7.7%

Amazon EMR