Finetuning language models on a collection of datasets phrased as instructions has been shown to improve model performance and generalization to unseen tasks. We also publicly release Flan-T5 checkpoints, which achieve strong few-shot performance even compared to much larger models, such as PaLM 62B.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Oct 20, 2022 |
| Journal | arXiv (Cornell University) |
| Authors | Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay |
| DOI | 10.48550/arxiv.2210.11416 |
| Citations | 1,192 |
| Source | OpenAlex |