Transformer architectures have become the model of choice in natural language processing and are now being introduced into computer vision tasks such as image classification, object detection, and semantic segmentation. Extensive experiments show that PoseFormer achieves advanced performance on both datasets.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Oct 01, 2021 |
| Journal | 2021 IEEE/CVF International Conference on Computer Vision (ICCV) |
| Authors | Ce Zheng, Sijie Zhu, Matías Mendieta, Taojiannan Yang, Chen Chen |
| DOI | 10.1109/iccv48922.2021.01145 |
| Citations | 650 |
| Source | OpenAlex |