Transformer model architectures have garnered immense interest lately due to their effectiveness across a range of domains like language, vision, and reinforcement learning. Recently, a dizzying number of “X-former” models have been proposed—Reformer, Linformer, Performer, Longformer, to name a few—which improve upon the original Transformer architecture, many of which make improvements around computational and memory efficiency .
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Apr 22, 2022 |
| Journal | ACM Computing Surveys |
| Authors | Yi Tay, Mostafa Dehghani, Dara Bahri, Donald Metzler |
| DOI | 10.1145/3530811 |
| Citations | 963 |
| Source | OpenAlex |