Home / Research Library / Scaling Instruction-Finetuned Language Models
🤖 Artificial Intelligence OpenAlex

Scaling Instruction-Finetuned Language Models

📅 October 20, 2022 👤 Hyung Won Chung, Le Hou, Shayne Longpre et al. 📖 arXiv (Cornell University) 📊 1,192 citations

🤖 Plain-English Summary

Finetuning language models on a collection of datasets phrased as instructions has been shown to improve model performance and generalization to unseen tasks. We also publicly release Flan-T5 checkpoints, which achieve strong few-shot performance even compared to much larger models, such as PaLM 62B.

🔑 Key Findings

  • In this paper we explore instruction finetuning with a particular focus on (1) scaling the number of tasks, (2) scaling the model size, and (3) finetuning on chain-of-thought data.
  • We find that instruction finetuning with the above aspects dramatically improves performance on a variety of model classes (PaLM, T5, U-PaLM), prompting setups (zero-shot, few-shot, CoT), and evaluation benchmarks (MMLU, BBH, TyDiQA, MGSM, open-ended generation).
  • For instance, Flan-PaLM 540B instruction-finetuned on 1.8K tasks outperforms PALM 540B by a large margin (+9.4% on average).

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Oct 20, 2022
Journal arXiv (Cornell University)
Authors Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay
DOI 10.48550/arxiv.2210.11416
Citations 1,192
Source OpenAlex

More 🤖 Artificial Intelligence Research