Remarkable effectiveness of the channel or spatial attention mechanisms for producing more discernible feature representation are illustrated in various computer vision tasks. Specifically, apart from encoding the global information to re-calibrate the channel-wise weight in each parallel branch, the output features of the two parallel branches are further aggregated by a cross-dimension interaction method.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | May 05, 2023 |
| Journal | Research Journal |
| Authors | Daliang Ouyang, Su He, Guozhong Zhang, Mingzhu Luo, Huaiyong Guo |
| DOI | 10.1109/icassp49357.2023.10096516 |
| Citations | 1,545 |
| Source | OpenAlex |