
Answer-first summary for fast verification
Answer: Use Retrieval Augmented Generation (RAG) with prompt engineering techniques.
## Detailed Explanation The question describes a scenario where a company needs a cost-effective AI/ML solution for customer service agents to handle frequently asked questions that may change over time. The key requirements are: 1. **Dynamic question handling** - Questions can evolve, so the solution must adapt without constant retraining. 2. **Cost-effectiveness** - The approach should minimize ongoing operational expenses. 3. **Automatic answer generation** - Agents should receive automatically generated responses. ### Analysis of Options: **A: Fine-tune the model regularly** - This involves retraining the model with new data whenever questions change. - **Disadvantages**: - High computational costs for frequent retraining. - Requires labeled datasets for each update. - Time-consuming and not scalable for rapidly changing questions. - Not cost-effective due to recurring training expenses. **B: Train the model by using context data** - This suggests training a model initially with contextual information. - **Disadvantages**: - Static approach; doesn't address evolving questions effectively. - Would require retraining when questions change, similar to option A. - Not optimized for dynamic content updates. **C: Pre-train and benchmark the model by using context data** - This focuses on initial model preparation and evaluation. - **Disadvantages**: - Primarily addresses initial setup, not ongoing adaptation to changing questions. - Benchmarking alone doesn't provide mechanisms for handling new questions. - Would still require retraining or additional solutions for evolving content. **D: Use Retrieval Augmented Generation (RAG) with prompt engineering techniques** - **Why this is optimal**: - **Dynamic content handling**: RAG combines a retrieval system with a generative model. The retrieval component can access updated knowledge sources (e.g., FAQs, documentation) without retraining the generative model. - **Cost-effectiveness**: - No need for frequent model retraining, reducing computational costs. - Updates can be made by modifying the knowledge base rather than retraining the entire model. - Leverages existing pre-trained models (like those in Amazon Bedrock) with minimal fine-tuning. - **Automatic answer generation**: The generative component produces answers based on retrieved relevant information. - **Prompt engineering**: Enhances response quality by crafting effective prompts that guide the model to generate accurate, context-aware answers. - **AWS best practice**: RAG is a recommended pattern in AWS AI/ML solutions for scenarios requiring up-to-date information without constant model retraining. ### Conclusion Retrieval Augmented Generation (RAG) with prompt engineering is the most cost-effective strategy because it separates the knowledge base from the generative model. This allows for easy updates to the question-answer content without expensive retraining cycles, while maintaining high-quality automatic answer generation. The other options involve more frequent and costly model updates, making them less suitable for this dynamic, cost-sensitive scenario.
Ultimate access to all questions.
No comments yet.
Author: LeetQuiz Editorial Team
A company has implemented an AI/ML solution to assist customer service agents by answering frequently asked questions. Since the questions may evolve, the company needs a cost-effective method to allow agents to query the system and get automatically generated responses to common customer inquiries.
Which approach will most cost-effectively fulfill these requirements?
A
Fine-tune the model regularly.
B
Train the model by using context data.
C
Pre-train and benchmark the model by using context data.
D
Use Retrieval Augmented Generation (RAG) with prompt engineering techniques.