Home / Research Library / Mobile-Former: Bridging MobileNet and Transformer
🤖 Artificial Intelligence OpenAlex

Mobile-Former: Bridging MobileNet and Transformer

📅 June 1, 2022 👤 Yinpeng Chen, Xiyang Dai, Dongdong Chen et al. 📖 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 📊 609 citations

🤖 Plain-English Summary

We present Mobile-Former, a parallel design of MobileNet and transformer with a two-way bridge in between. Additionally, we build an efficient end-to-end detector by replacing backbone, encoder and decoder in DETR with Mobile-Former, which outperforms DETR by 1.3 AP but saves 52% of computational cost and 36% of parameters.

🔑 Key Findings

  • This structure leverages the advantages of MobileNet at local processing and transformer at global interaction.
  • And the bridge enables bidirectional fusion of local and global features.
  • Different from recent works on vision transformer, the transformer in Mobile-Former contains very few tokens (e.g.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jun 01, 2022
Journal 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Authors Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Xiaoyi Dong
DOI 10.1109/cvpr52688.2022.00520
Citations 609
Source OpenAlex

More 🤖 Artificial Intelligence Research