Transformer with self-attention has led to the revolutionizing of natural language processing field, and recently inspires the emergence of Transformer-style architecture design with competitive results in numerous computer vision tasks. Through extensive experiments over a wide range of applications (e.g., image recognition, object detection, instance segmentation, and semantic segmentation), we validate the superiority of CoTNet as a stronger backbone.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Apr 01, 2022 |
| Journal | IEEE Transactions on Pattern Analysis and Machine Intelligence |
| Authors | Yehao Li, Ting Yao, Yingwei Pan, Tao Mei |
| DOI | 10.1109/tpami.2022.3164083 |
| Citations | 698 |
| Source | OpenAlex |