
Answer-first summary for fast verification
Answer: Short-term working memory
The context window of a model refers to the amount of text or tokens the model can process at once during inference. This is analogous to **short-term working memory** in human cognition because: 1. **Limited capacity**: Just like human working memory has limited capacity (typically 7±2 items), a model's context window has a fixed token limit. 2. **Temporary storage**: The context window holds information temporarily for processing, similar to how working memory holds information for immediate cognitive tasks. 3. **Active processing**: Both are used for active reasoning and computation rather than long-term storage. 4. **Direct accessibility**: Information in the context window is immediately accessible to the model, just as information in working memory is immediately accessible for cognitive processing. **Why other options are incorrect**: - **Long-term memory (A)**: This refers to permanent storage, while context window is temporary. - **Neural embedding space (C)**: This refers to the mathematical representation of concepts, not the processing capacity. - **Batch processing unit (D)**: This relates to parallel processing of multiple inputs, not the immediate processing capacity of a single input sequence.
Author: Ritesh Yadav
Ultimate access to all questions.
No comments yet.