Large language models (LMs) are able to in-context learn—perform a new task via inference alone by conditioning on a few input-label pairs (demonstrations) and making predictions for new inputs. Instead, we find that other aspects of the demonstrations are the key drivers of endtask performance, including the fact that they provide a few examples of (1) the label space, (2) the distribution of the input text, and (3) the overall format of the sequence.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Jan 01, 2022 |
| Journal | Research Journal |
| Authors | Sewon Min, Xinxi Lyu, Ari Holtzman, Mikel Artetxe, Mike Lewis |
| DOI | 10.18653/v1/2022.emnlp-main.759 |
| Citations | 640 |
| Source | OpenAlex |