Home / Research Library / Visual attention network
🤖 Artificial Intelligence OpenAlex

Visual attention network

📅 July 28, 2023 👤 Meng-Hao Guo, Cheng-Ze Lu, Zheng-Ning Liu et al. 📖 Computational Visual Media 📊 972 citations

🤖 Plain-English Summary

While originally designed for natural language processing tasks, the self-attention mechanism has recently taken various computer vision areas by storm. It provides a novel method and a simple yet strong baseline for the community.

🔑 Key Findings

  • However, the 2D nature of images brings three challenges for applying self-attention in computer vision: (1) treating images as 1D sequences neglects their 2D structures; (2) the quadratic complexity is too expensive for high-resolution images; (3) it only captures spatial adaptability but ignores channel adaptability.
  • In this paper, we propose a novel linear attention named large kernel attention (LKA) to enable self-adaptive and long-range correlations in self-attention while avoiding its shortcomings.
  • Additionally, we present a neural network based on LKA, namely Visual Attention Network (VAN).

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jul 28, 2023
Journal Computational Visual Media
Authors Meng-Hao Guo, Cheng-Ze Lu, Zheng-Ning Liu, Ming‐Ming Cheng, Shi‐Min Hu
DOI 10.1007/s41095-023-0364-2
Citations 972
Source OpenAlex

More 🤖 Artificial Intelligence Research