Home / Research Library / Attention mechanisms in computer vision: A survey
🤖 Artificial Intelligence OpenAlex

Attention mechanisms in computer vision: A survey

📅 March 15, 2022 👤 Meng-Hao Guo, Tian-Xing Xu, Jiangjiang Liu et al. 📖 Computational Visual Media 📊 2,357 citations

🤖 Plain-English Summary

Humans can naturally and effectively find salient regions in complex scenes. In this survey, we provide a comprehensive review of various attention mechanisms in computer vision and categorize them according to approach, such as channel attention, spatial attention, temporal attention, and branch attention; a related repository https://github.com/MenghaoGuo/Awesome-Vision-Attentions is dedicated to collecting related work.

🔑 Key Findings

  • Motivated by this observation, attention mechanisms were introduced into computer vision with the aim of imitating this aspect of the human visual system.
  • Such an attention mechanism can be regarded as a dynamic weight adjustment process based on features of the input image.
  • Attention mechanisms have achieved great success in many visual tasks, including image classification, object detection, semantic segmentation, video understanding, image generation, 3D vision, multimodal tasks, and self-supervised learning.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Mar 15, 2022
Journal Computational Visual Media
Authors Meng-Hao Guo, Tian-Xing Xu, Jiangjiang Liu, Zheng-Ning Liu, Peng-Tao Jiang
DOI 10.1007/s41095-022-0271-y
Citations 2,357
Source OpenAlex

More 🤖 Artificial Intelligence Research