Home / Research Library / Point-BERT: Pre-training 3D Point Cloud Transforme...
🤖 Artificial Intelligence OpenAlex

Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling

📅 June 1, 2022 👤 Xumin Yu, Lulu Tang, Yongming Rao et al. 📖 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 📊 704 citations

🤖 Plain-English Summary

We present Point-BERT, a new paradigm for learning Transformers to generalize the concept of BERT to 3D point cloud. We also demonstrate that the representations learned by Point-BERT transfer well to new tasks and domains, where our models largely advance the advanced of few-shot point cloud classification task.

🔑 Key Findings

  • Inspired by BERT, we devise a Masked Point Modeling (MPM) task to pre-train point cloud Transformers.
  • Specifically, we first divide a point cloud into several local point patches, and a point cloud Tokenizer with a discrete Variational AutoEncoder (dVAE) is designed to generate discrete point tokens containing meaningful local information.
  • Then, we randomly mask out some patches of input point clouds and feed them into the backbone Transformers.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jun 01, 2022
Journal 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Authors Xumin Yu, Lulu Tang, Yongming Rao, Tiejun Huang, Jie Zhou
DOI 10.1109/cvpr52688.2022.01871
Citations 704
Source OpenAlex

More 🤖 Artificial Intelligence Research