Home / Research Articles Hub / UNETR: Transformers for 3D Medical Image Segmentat...
🤖 Artificial Intelligence OpenAlex

UNETR: Transformers for 3D Medical Image Segmentation

📅 Published: January 1, 2022 👤 Ali Hatamizadeh, Yucheng Tang, Vishwesh Nath et al. 📖 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 📊 2,838 citations
AI-Generated Summary

Fully Convolutional Neural Networks (FCNNs) with contracting and expanding paths have shown prominence for the majority of medical image segmentation applications since the past decade. We have validated the performance of our method on the Multi Atlas Labeling Beyond The Cranial Vault (BTCV) dataset for multi-organ segmentation and the Medical Segmentation Decathlon (MSD) dataset for brain tumor and spleen segmentation tasks.

⚡ This is an original paraphrased summary — not copied from the abstract. Full paper available at the source link below.

Key Findings
  • 1 In FCNNs, the encoder plays an integral role by learning both global and local features and contextual representations which can be utilized for semantic output prediction by the decoder.
  • 2 Despite their success, the locality of convolutional layers in FCNNs, limits the capability of learning long-range spatial dependencies.
  • 3 Inspired by the recent success of transformers for Natural Language Processing (NLP) in long-range sequence learning, we reformulate the task of volumetric (3D) medical image segmentation as a sequence-to-sequence prediction problem.
Why It Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

This summary is based on publicly available metadata and abstract. For the full research paper, visit the original source:

Read Full Paper at OpenAlex
More Artificial Intelligence Papers ← Back to Hub 📚 Learning Hub
Article Details
Source OpenAlex
Category 🤖 Artificial Intelligence
Published Jan 1, 2022
Journal 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
DOI 10.1109/wacv51458.2022.00181
Citations 2,838
Authors Ali Hatamizadeh, Yucheng Tang, Vishwesh Nath, Dong Yang, Andriy Myronenko