Home / Research Library / ProteinBERT: a universal deep-learning model of pr...
🤖 Artificial Intelligence OpenAlex

ProteinBERT: a universal deep-learning model of protein sequence and function

📅 January 8, 2022 👤 Nadav Brandes, Dan Ofer, Yam Peleg et al. 📖 Bioinformatics 📊 967 citations

🤖 Plain-English Summary

SUMMARY: Self-supervised deep language modeling has shown unprecedented success across natural language tasks, and has recently been repurposed to biological sequences. AVAILABILITY AND IMPLEMENTATION: Code and pretrained model weights are available at https://github.com/nadavbra/protein_bert.

🔑 Key Findings

  • However, existing models and pretraining methods are designed and optimized for text analysis.
  • We introduce ProteinBERT, a deep language model specifically designed for proteins.
  • Our pretraining scheme combines language modeling with a novel task of Gene Ontology (GO) annotation prediction.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jan 08, 2022
Journal Bioinformatics
Authors Nadav Brandes, Dan Ofer, Yam Peleg, Nadav Rappoport, Michal Linial
DOI 10.1093/bioinformatics/btac020
Citations 967
Source OpenAlex

More 🤖 Artificial Intelligence Research