
Ultimate access to all questions.
A Generative AI Engineer is developing a RAG application to answer questions about internal documents for the company SnoPen AI. The source documents may contain a substantial amount of irrelevant content, including advertisements, sports news, entertainment news, or information about other companies. Which approach is recommended for building the RAG application to effectively filter out this irrelevant information?
A
Keep all articles because the RAG application needs to understand non-company content to avoid answering questions about them.
B
Include in the system prompt that any information it sees will be about SnoPenAI, even if no data filtering is performed.
C
Include in the system prompt that the application is not supposed to answer any questions unrelated to SnoPen AI.
D
Consolidate all SnoPen AI related documents into a single chunk in the vector database.