Home / Research Library / ResNeSt: Split-Attention Networks
🤖 Artificial Intelligence OpenAlex

ResNeSt: Split-Attention Networks

📅 June 1, 2022 👤 Hang Zhang, Chongruo Wu, Zhongyue Zhang et al. 📖 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 📊 1,278 citations

🤖 Plain-English Summary

The ability to learn richer network representations generally boosts the performance of deep learning models. Adding a Split-Attention module into the architecture design space of RegNet-Y and FBNetV2 directly improves the performance of the resulting network.

🔑 Key Findings

  • To improve representation-learning in convolutional neural networks, we present a multi-branch architecture, which applies channel-wise attention across different network branches to leverage the complementary strengths of both feature-map attention and multi-path representation.
  • Our proposed Split-Attention module provides a simple and modular computation block that can serve as a drop-in replacement for the popular residual block, while producing more diverse representations via cross-feature interactions.
  • Adding a Split-Attention module into the architecture design space of RegNet-Y and FBNetV2 directly improves the performance of the resulting network.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jun 01, 2022
Journal 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Authors Hang Zhang, Chongruo Wu, Zhongyue Zhang, Yi Zhu, Haibin Lin
DOI 10.1109/cvprw56347.2022.00309
Citations 1,278
Source OpenAlex

More 🤖 Artificial Intelligence Research