
Answer-first summary for fast verification
Answer: Keep all news articles because the RAG application needs to understand non-technological content to avoid answering questions about them.
The question asks which approach is NOT advisable for building a RAG application focused on technology-only questions. Option C suggests keeping all news articles (including irrelevant content like sports and entertainment) so the RAG application can 'understand' non-technological content to avoid answering questions about them. This is not advisable because: (1) It introduces noise that can degrade retrieval quality and increase the chance of retrieving irrelevant documents when answering technology questions; (2) It wastes computational resources processing and storing irrelevant content; (3) The community discussion with 100% consensus and upvotes supports that filtering irrelevant content is better, with the comment 'garbage in, garbage out' highlighting why keeping irrelevant articles is problematic. In contrast, options A (system prompt filtering), B (filtering in retrieval), and D (upstream filtering) are all valid approaches to handle irrelevant content in a focused RAG application.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
When building a RAG application specifically for answering technology-related questions from news articles that contain significant irrelevant content (e.g., advertisements, sports, entertainment), which of the following approaches is NOT advisable?
A
Include in the system prompt that the application is not supposed to answer any questions unrelated to technology.
B
Filter out irrelevant news articles in the retrieval process.
C
Keep all news articles because the RAG application needs to understand non-technological content to avoid answering questions about them.
D
Filter out irrelevant news articles in the upstream document database.
No comments yet.