Home / Research Library / TrackFormer: Multi-Object Tracking with Transforme...
🤖 Artificial Intelligence OpenAlex

TrackFormer: Multi-Object Tracking with Transformers

📅 June 1, 2022 👤 Tim Meinhardt, Alexander Kirillov, Laura Leal-Taixé et al. 📖 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 📊 942 citations

🤖 Plain-English Summary

The challenging task of multi-object tracking (MOT) requires simultaneous reasoning about track initialization, identity, and spatio-temporal trajectories. TrackFormer introduces a new tracking-by-attention paradigm and while simple in its design is able to achieve advanced performance on the task of multi-object tracking (MOT17) and segmentation (MOTS20).

🔑 Key Findings

  • We formulate this task as a frame-to-frame set prediction problem and introduce TrackFormer, an end-to-end trainable MOT approach based on an encoder-decoder Transformer architecture.
  • Our model achieves data association between frames via attention by evolving a set of track predictions through a video sequence.
  • The Transformer decoder initializes new tracks from static object queries and autoregressively follows existing tracks in space and time with the conceptually new and identity preserving track queries.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jun 01, 2022
Journal 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Authors Tim Meinhardt, Alexander Kirillov, Laura Leal-Taixé, Christoph Feichtenhofer
DOI 10.1109/cvpr52688.2022.00864
Citations 942
Source OpenAlex

More 🤖 Artificial Intelligence Research