Home / Research Library / Efficient Transformers: A Survey
🤖 Artificial Intelligence OpenAlex

Efficient Transformers: A Survey

📅 April 22, 2022 👤 Yi Tay, Mostafa Dehghani, Dara Bahri et al. 📖 ACM Computing Surveys 📊 963 citations

🤖 Plain-English Summary

Transformer model architectures have garnered immense interest lately due to their effectiveness across a range of domains like language, vision, and reinforcement learning. Recently, a dizzying number of “X-former” models have been proposed—Reformer, Linformer, Performer, Longformer, to name a few—which improve upon the original Transformer architecture, many of which make improvements around computational and memory efficiency .

🔑 Key Findings

  • In the field of natural language processing for example, Transformers have become an indispensable staple in the modern deep learning stack.
  • Recently, a dizzying number of “X-former” models have been proposed—Reformer, Linformer, Performer, Longformer, to name a few—which improve upon the original Transformer architecture, many of which make improvements around computational and memory efficiency .
  • With the aim of helping the avid researcher navigate this flurry, this article characterizes a large and thoughtful selection of recent efficiency-flavored “X-former” models, providing an organized and comprehensive overview of existing work and models across multiple domains.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Apr 22, 2022
Journal ACM Computing Surveys
Authors Yi Tay, Mostafa Dehghani, Dara Bahri, Donald Metzler
DOI 10.1145/3530811
Citations 963
Source OpenAlex

More 🤖 Artificial Intelligence Research