Home / Research Library / LLaMA: Open and Efficient Foundation Language Mode...
🤖 Artificial Intelligence OpenAlex

LLaMA: Open and Efficient Foundation Language Models

📅 February 27, 2023 👤 Hugo Touvron, Thibaut Lavril, Gautier Izacard et al. 📖 arXiv (Cornell University) 📊 3,897 citations

🤖 Plain-English Summary

We introduce LLaMA, a collection of foundation language models ranging from 7B to 65B parameters. In particular, LLaMA-13B outperforms GPT-3 (175B) on most benchmarks, and LLaMA-65B is competitive with the best models, Chinchilla-70B and PaLM-540B.

🔑 Key Findings

  • We train our models on trillions of tokens, and show that it is possible to train advanced models using publicly available datasets exclusively, without resorting to proprietary and inaccessible datasets.
  • In particular, LLaMA-13B outperforms GPT-3 (175B) on most benchmarks, and LLaMA-65B is competitive with the best models, Chinchilla-70B and PaLM-540B.
  • We release all our models to the research community.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Feb 27, 2023
Journal arXiv (Cornell University)
Authors Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux
DOI 10.48550/arxiv.2302.13971
Citations 3,897
Source OpenAlex

More 🤖 Artificial Intelligence Research