Home / Research Articles Hub / DINO: DETR with Improved DeNoising Anchor Boxes fo...
🤖 Artificial Intelligence OpenAlex

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

📅 Published: March 7, 2022 👤 Hao Zhang, Feng Li, Shilong Liu et al. 📖 arXiv (Cornell University) 📊 761 citations
AI-Generated Summary

We present DINO (\textbf{D}ETR with \textbf{I}mproved de\textbf{N}oising anch\textbf{O}r boxes), a advanced end-to-end object detector. Compared to other models on the leaderboard, DINO significantly reduces its model size and pre-training data size while achieving better results.

⚡ This is an original paraphrased summary — not copied from the abstract. Full paper available at the source link below.

Key Findings
  • 1 DINO improves over previous DETR-like models in performance and efficiency by using a contrastive way for denoising training, a mixed query selection method for anchor initialization, and a look forward twice scheme for box prediction.
  • 2 DINO achieves $49.4$AP in $12$ epochs and $51.3$AP in $24$ epochs on COCO with a ResNet-50 backbone and multi-scale features, yielding a significant improvement of $\textbf{+6.0}$\textbf{AP} and $\textbf{+2.7}$\textbf{AP}, respectively, compared to DN-DETR, the previous best DETR-like model.
  • 3 DINO scales well in both model size and data size.
Why It Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

This summary is based on publicly available metadata and abstract. For the full research paper, visit the original source:

Read Full Paper at OpenAlex
More Artificial Intelligence Papers ← Back to Hub 📚 Learning Hub
Article Details
Source OpenAlex
Category 🤖 Artificial Intelligence
Published Mar 7, 2022
Journal arXiv (Cornell University)
DOI 10.48550/arxiv.2203.03605
Citations 761
Authors Hao Zhang, Feng Li, Shilong Liu, Lei Zhang, Hang Su