Home / Research Library / Emergent Abilities of Large Language Models
🤖 Artificial Intelligence OpenAlex

Emergent Abilities of Large Language Models

📅 June 15, 2022 👤 Wei, Jason, Yi Tay, Rishi Bommasani et al. 📖 arXiv (Cornell University) 📊 1,030 citations

🤖 Plain-English Summary

Scaling up language models has been shown to predictably improve performance and sample efficiency on a wide range of downstream tasks. Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models.

🔑 Key Findings

  • This paper instead discusses an unpredictable phenomenon that we refer to as emergent abilities of large language models.
  • We consider an ability to be emergent if it is not present in smaller models but is present in larger models.
  • Thus, emergent abilities cannot be predicted simply by extrapolating the performance of smaller models.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jun 15, 2022
Journal arXiv (Cornell University)
Authors Wei, Jason, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph
DOI 10.48550/arxiv.2206.07682
Citations 1,030
Source OpenAlex

More 🤖 Artificial Intelligence Research