Home / Research Library / ResMLP: Feedforward Networks for Image Classificat...
🤖 Artificial Intelligence OpenAlex

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

📅 September 12, 2022 👤 Hugo Touvron, Piotr Bojanowski, Mathilde Caron et al. 📖 IEEE Transactions on Pattern Analysis and Machine Intelligence 📊 763 citations

🤖 Plain-English Summary

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. Finally, by adapting our model to machine translation we achieve surprisingly good results.

🔑 Key Findings

  • It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch.
  • When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet.
  • We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Sep 12, 2022
Journal IEEE Transactions on Pattern Analysis and Machine Intelligence
Authors Hugo Touvron, Piotr Bojanowski, Mathilde Caron, Matthieu Cord, Alaaeldin El-Nouby
DOI 10.1109/tpami.2022.3206148
Citations 763
Source OpenAlex

More 🤖 Artificial Intelligence Research