As the core building block of vision transformers, attention is a powerful tool to capture long-range dependency. Empirical results across several computer vision tasks such as image classification, object detection, and semantic segmentation verify the effectiveness of our design.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Jun 01, 2023 |
| Journal | Research Journal |
| Authors | Lei Zhu, Xinjiang Wang, Zhanghan Ke, Wayne Zhang, Rynson W. H. Lau |
| DOI | 10.1109/cvpr52729.2023.00995 |
| Citations | 1,044 |
| Source | OpenAlex |