Compared to the great progress of large-scale vision transformers (ViTs) in recent years, large-scale models based on convolutional neural networks (CNNs) are still in an early state. The effectiveness of our model is proven on challenging benchmarks including ImageNet, COCO, andADE20K.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Jun 01, 2023 |
| Journal | Research Journal |
| Authors | Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li |
| DOI | 10.1109/cvpr52729.2023.01385 |
| Citations | 894 |
| Source | OpenAlex |