AWS Certified AI Practitioner

Get started today

Ultimate access to all questions.

Explanation:

Detailed Explanation

The question describes a scenario where a company needs a cost-effective AI/ML solution for customer service agents to handle frequently asked questions that may change over time. The key requirements are:

Dynamic question handling - Questions can evolve, so the solution must adapt without constant retraining.
Cost-effectiveness - The approach should minimize ongoing operational expenses.
Automatic answer generation - Agents should receive automatically generated responses.

Analysis of Options:

A: Fine-tune the model regularly

This involves retraining the model with new data whenever questions change.
Disadvantages:
- High computational costs for frequent retraining.
- Requires labeled datasets for each update.
- Time-consuming and not scalable for rapidly changing questions.
- Not cost-effective due to recurring training expenses.

B: Train the model by using context data

This suggests training a model initially with contextual information.
Disadvantages:
- Static approach; doesn't address evolving questions effectively.
- Would require retraining when questions change, similar to option A.
- Not optimized for dynamic content updates.

C: Pre-train and benchmark the model by using context data

This focuses on initial model preparation and evaluation.
Disadvantages:
- Primarily addresses initial setup, not ongoing adaptation to changing questions.
- Benchmarking alone doesn't provide mechanisms for handling new questions.
- Would still require retraining or additional solutions for evolving content.

D: Use Retrieval Augmented Generation (RAG) with prompt engineering techniques

Why this is optimal:
- Dynamic content handling: RAG combines a retrieval system with a generative model. The retrieval component can access updated knowledge sources (e.g., FAQs, documentation) without retraining the generative model.
- Cost-effectiveness:
  - No need for frequent model retraining, reducing computational costs.
  - Updates can be made by modifying the knowledge base rather than retraining the entire model.
  - Leverages existing pre-trained models (like those in Amazon Bedrock) with minimal fine-tuning.
- Automatic answer generation: The generative component produces answers based on retrieved relevant information.
- Prompt engineering: Enhances response quality by crafting effective prompts that guide the model to generate accurate, context-aware answers.
- AWS best practice: RAG is a recommended pattern in AWS AI/ML solutions for scenarios requiring up-to-date information without constant model retraining.

Conclusion

Retrieval Augmented Generation (RAG) with prompt engineering is the most cost-effective strategy because it separates the knowledge base from the generative model. This allows for easy updates to the question-answer content without expensive retraining cycles, while maintaining high-quality automatic answer generation. The other options involve more frequent and costly model updates, making them less suitable for this dynamic, cost-sensitive scenario.

Explanation:

Detailed Explanation

Dynamic question handling - Questions can evolve, so the solution must adapt without constant retraining.
Cost-effectiveness - The approach should minimize ongoing operational expenses.
Automatic answer generation - Agents should receive automatically generated responses.

Analysis of Options:

A: Fine-tune the model regularly

This involves retraining the model with new data whenever questions change.
Disadvantages:
- High computational costs for frequent retraining.
- Requires labeled datasets for each update.
- Time-consuming and not scalable for rapidly changing questions.
- Not cost-effective due to recurring training expenses.

B: Train the model by using context data

This suggests training a model initially with contextual information.
Disadvantages:
- Static approach; doesn't address evolving questions effectively.
- Would require retraining when questions change, similar to option A.
- Not optimized for dynamic content updates.

C: Pre-train and benchmark the model by using context data

This focuses on initial model preparation and evaluation.
Disadvantages:
- Primarily addresses initial setup, not ongoing adaptation to changing questions.
- Benchmarking alone doesn't provide mechanisms for handling new questions.
- Would still require retraining or additional solutions for evolving content.

D: Use Retrieval Augmented Generation (RAG) with prompt engineering techniques

Why this is optimal:
- Dynamic content handling: RAG combines a retrieval system with a generative model. The retrieval component can access updated knowledge sources (e.g., FAQs, documentation) without retraining the generative model.
- Cost-effectiveness:
  - No need for frequent model retraining, reducing computational costs.
  - Updates can be made by modifying the knowledge base rather than retraining the entire model.
  - Leverages existing pre-trained models (like those in Amazon Bedrock) with minimal fine-tuning.
- Automatic answer generation: The generative component produces answers based on retrieved relevant information.
- Prompt engineering: Enhances response quality by crafting effective prompts that guide the model to generate accurate, context-aware answers.
- AWS best practice: RAG is a recommended pattern in AWS AI/ML solutions for scenarios requiring up-to-date information without constant model retraining.

Conclusion

Comments (0)

No comments yet.

A company has implemented an AI/ML solution to assist customer service agents by answering frequently asked questions. Since the questions may evolve, the company needs a cost-effective method to allow agents to query the system and get automatically generated responses to common customer inquiries.

Which approach will most cost-effectively fulfill these requirements?

Exam-Like

Last updated: April 22, 2026 at 14:02

Fine-tune the model regularly.

10.0%

Train the model by using context data.

0.0%

Pre-train and benchmark the model by using context data.

Use Retrieval Augmented Generation (RAG) with prompt engineering techniques.