Tuesday Jul 11, 2023
arxiv preprint - Lost in the Middle: How Language Models Use Long Contexts
In this episode we discuss Lost in the Middle: How Language Models Use Long Contexts by Nelson F. Liu, Kevin Lin, John Hewitt, Ashwin Paranjape, Michele Bevilacqua, Fabio Petroni, Percy Liang. This paper examines the impact of context length on the performance of language models in tasks such as multi-document question answering and key-value retrieval. The authors find that models perform best when relevant information is at the beginning or end of the context, but struggle to access information in the middle of long contexts. Additionally, performance decreases as the input context becomes longer, even for models specifically designed for long-context processing.
Comments (0)
To leave or reply to comments, please download free Podbean or
No Comments
To leave or reply to comments,
please download free Podbean App.