Home / Research Library / YOLOv10: Real-Time End-to-End Object Detection
🤖 Artificial Intelligence OpenAlex

YOLOv10: Real-Time End-to-End Object Detection

📅 May 23, 2024 👤 Ao Wang, Hui Chen, Lihao Liu et al. 📖 arXiv (Cornell University) 📊 1,032 citations

🤖 Plain-English Summary

Over the past years, YOLOs have emerged as the predominant paradigm in the field of real-time object detection owing to their effective balance between computational cost and detection performance. For example, our YOLOv10-S is 1.8$\times$ faster than RT-DETR-R18 under the similar AP on COCO, meanwhile enjoying 2.8$\times$ smaller number of parameters and FLOPs.

🔑 Key Findings

  • Researchers have explored the architectural designs, optimization objectives, data augmentation strategies, and others for YOLOs, achieving notable progress.
  • However, the reliance on the non-maximum suppression (NMS) for post-processing hampers the end-to-end deployment of YOLOs and adversely impacts the inference latency.
  • Besides, the design of various components in YOLOs lacks the comprehensive and thorough inspection, resulting in noticeable computational redundancy and limiting the model's capability.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published May 23, 2024
Journal arXiv (Cornell University)
Authors Ao Wang, Hui Chen, Lihao Liu, Kai Chen, Zijia Lin
DOI 10.48550/arxiv.2405.14458
Citations 1,032
Source OpenAlex

More 🤖 Artificial Intelligence Research