Beyond Self-Attention: External Attention Using Two Linear L...

🤖 Plain-English Summary

Attention mechanisms, especially self-attention, have played an increasingly important role in deep feature representation for visual tasks. We further incorporate the multi-head mechanism into external attention to provide an all-MLP architecture, external attention MLP (EAMLP), for image classification.

🔑 Key Findings

Self-attention updates the feature at each position by computing a weighted sum of features using pair-wise affinities across all positions to capture the long-range dependency within a single sample.
However, self-attention has quadratic complexity and ignores potential correlation between different samples.
This article proposes a novel attention mechanism which we call external attention, based on two external, small, learnable, shared memories, which can be implemented easily by simply using two cascaded linear layers and two normalization layers; it conveniently replaces self-attention in existing popular architectures.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📜 Copyright Notice: This page shows only metadata (title, authors, journal, date) and an original AI-generated summary. No abstract or full article text is copied. The original research is the intellectual property of its authors and publisher. ScienceTrace does not reproduce copyrighted content.

← More Artificial Intelligence All Research Articles

📋 Article Details

Category	🤖 Artificial Intelligence
Published	Jan 01, 2022
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Authors	Meng-Hao Guo, Zheng-Ning Liu, Tai‐Jiang Mu, Shi‐Min Hu
DOI	10.1109/tpami.2022.3211006
Citations	622
Source	OpenAlex

🗂️ Research Categories

🤖 Artificial Intelligence 🧬 Medicine & Biology ⚛️ Physics & Space Science ⚙️ Engineering & Technology ∑ Mathematics

Beyond Self-Attention: External Attention Using Two Linear Layers for Visual Tasks

🤖 Plain-English Summary

🔑 Key Findings

💡 Why This Matters

📋 Article Details

🗂️ Research Categories

🔗 Related Resources

More 🤖 Artificial Intelligence Research