Abstract While recent language models have the ability to take long contexts as input, relatively little is known about how well they use longer context. In particular, we observe that performance is often highest when relevant information occurs at the beginning or end of the input context, and significantly degrades when models must access relevant information in the middle of long contexts, even for explicitly long-context models.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Jan 01, 2024 |
| Journal | Transactions of the Association for Computational Linguistics |
| Authors | Nelson F. Liu, Kevin Lin, John Hewitt, Ashwin Paranjape, Michele Bevilacqua |
| DOI | 10.1162/tacl_a_00638 |
| Citations | 887 |
| Source | OpenAlex |