Home / Research Articles Hub / ProteinBERT: a universal deep-learning model of pr...
🤖 Artificial Intelligence OpenAlex

ProteinBERT: a universal deep-learning model of protein sequence and function

📅 Published: January 8, 2022 👤 Nadav Brandes, Dan Ofer, Yam Peleg et al. 📖 Bioinformatics 📊 967 citations
AI-Generated Summary

SUMMARY: Self-supervised deep language modeling has shown unprecedented success across natural language tasks, and has recently been repurposed to biological sequences. AVAILABILITY AND IMPLEMENTATION: Code and pretrained model weights are available at https://github.com/nadavbra/protein_bert.

⚡ This is an original paraphrased summary — not copied from the abstract. Full paper available at the source link below.

Key Findings
  • 1 However, existing models and pretraining methods are designed and optimized for text analysis.
  • 2 We introduce ProteinBERT, a deep language model specifically designed for proteins.
  • 3 Our pretraining scheme combines language modeling with a novel task of Gene Ontology (GO) annotation prediction.
Why It Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

This summary is based on publicly available metadata and abstract. For the full research paper, visit the original source:

Read Full Paper at OpenAlex
More Artificial Intelligence Papers ← Back to Hub 📚 Learning Hub
Article Details
Source OpenAlex
Category 🤖 Artificial Intelligence
Published Jan 8, 2022
Journal Bioinformatics
DOI 10.1093/bioinformatics/btac020
Citations 967
Authors Nadav Brandes, Dan Ofer, Yam Peleg, Nadav Rappoport, Michal Linial