Home / Research Library / The conserved domain database in 2023
∑ Mathematics OpenAlex

The conserved domain database in 2023

📅 December 8, 2022 👤 Jiyao Wang, Farideh Chitsaz, Myra K. Derbyshire et al. 📖 Nucleic Acids Research 📊 1,171 citations

🤖 Plain-English Summary

NLM's conserved domain database (CDD) is a collection of protein domain and protein family models constructed as multiple sequence alignments. CDD curation staff builds hierarchical classifications of large protein domain families, adds models for novel domain families via surveillance of the protein 'dark matter' that currently lacks annotation, and now spends considerable effort on providing names and attribution for conserved domain architectures.

🔑 Key Findings

  • Its main purpose is to provide annotation for protein and translated nucleotide sequences with the location of domain footprints and associated functional sites, and to define protein domain architecture as a basis for assigning gene product names and putative/predicted function.
  • CDD has been available publicly for over 20 years and has grown substantially during that time.
  • Maintaining an archive of pre-computed annotation continues to be a challenge and has slowed down the cadence of CDD releases.

💡 Why This Matters

Mathematical breakthroughs form the theoretical backbone of science, cryptography, data analysis, and engineering.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category ∑ Mathematics
Published Dec 08, 2022
Journal Nucleic Acids Research
Authors Jiyao Wang, Farideh Chitsaz, Myra K. Derbyshire, Noreen R. Gonzales, Marc Gwadz
DOI 10.1093/nar/gkac1096
Citations 1,171
Source OpenAlex

More ∑ Mathematics Research