Home / Research Articles Hub / Beyond Self-Attention: External Attention Using Tw...
🤖 Artificial Intelligence OpenAlex

Beyond Self-Attention: External Attention Using Two Linear Layers for Visual Tasks

📅 Published: January 1, 2022 👤 Meng-Hao Guo, Zheng-Ning Liu, Tai‐Jiang Mu et al. 📖 IEEE Transactions on Pattern Analysis and Machine Intelligence 📊 622 citations
AI-Generated Summary

Attention mechanisms, especially self-attention, have played an increasingly important role in deep feature representation for visual tasks. We further incorporate the multi-head mechanism into external attention to provide an all-MLP architecture, external attention MLP (EAMLP), for image classification.

⚡ This is an original paraphrased summary — not copied from the abstract. Full paper available at the source link below.

Key Findings
  • 1 Self-attention updates the feature at each position by computing a weighted sum of features using pair-wise affinities across all positions to capture the long-range dependency within a single sample.
  • 2 However, self-attention has quadratic complexity and ignores potential correlation between different samples.
  • 3 This article proposes a novel attention mechanism which we call external attention, based on two external, small, learnable, shared memories, which can be implemented easily by simply using two cascaded linear layers and two normalization layers; it conveniently replaces self-attention in existing popular architectures.
Why It Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

This summary is based on publicly available metadata and abstract. For the full research paper, visit the original source:

Read Full Paper at OpenAlex
More Artificial Intelligence Papers ← Back to Hub 📚 Learning Hub
Article Details
Source OpenAlex
Category 🤖 Artificial Intelligence
Published Jan 1, 2022
Journal IEEE Transactions on Pattern Analysis and Machine Intelligence
DOI 10.1109/tpami.2022.3211006
Citations 622
Authors Meng-Hao Guo, Zheng-Ning Liu, Tai‐Jiang Mu, Shi‐Min Hu