
Answer-first summary for fast verification
Answer: Create pairs of questions and answers that specifically address topics related to the company's industry domain.
Instruction-based fine-tuning requires training data structured as input-output pairs where the input is an instruction or question and the output is the desired response. This approach teaches the model to follow specific instructions within a domain. **Why C is correct:** - Creating pairs of questions and answers that address domain-specific topics directly provides the structured input-output format needed for instruction-based fine-tuning. - This approach explicitly teaches the model how to respond to domain-relevant queries, aligning with the goal of question answering in a specific domain. - The question-answer pairs serve as clear examples of the desired behavior, enabling the model to learn the mapping between domain-specific instructions and appropriate responses. **Why other options are less suitable:** - **A:** Gathering and merging documents into a single file provides unstructured data, not the structured instruction-response pairs required for instruction-based fine-tuning. - **B:** Collecting and labeling reviews as positive/negative is suitable for sentiment analysis tasks, not for instruction-based question answering. - **D:** Creating few-shot prompts is useful for inference-time prompting but doesn't constitute training data preparation for fine-tuning; fine-tuning requires actual training examples, not just inference prompts.
Author: LeetQuiz Editorial Team
Ultimate access to all questions.
No comments yet.
A company plans to perform instruction-based fine-tuning of a foundation model (FM) for question answering within a specific domain.
How should the company structure the training data for this task?
A
Gather company internal documents and industry-specific materials. Merge the documents and materials into a single file.
B
Collect external company reviews from various online sources. Manually label each review as either positive or negative.
C
Create pairs of questions and answers that specifically address topics related to the company's industry domain.
D
Create few-shot prompts to instruct the model to answer only domain knowledge.