Pretraining large neural language models, such as BERT, has led to impressive gains on many natural language processing (NLP) tasks. Further, in conducting a thorough evaluation of modeling choices, both for pretraining and task-specific fine-tuning, we discover that some common practices are unnecessary with BERT models, such as using complex tagging schemes in named entity recognition.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Oct 15, 2021 |
| Journal | ACM Transactions on Computing for Healthcare |
| Authors | 裕二 池谷, Robert Tinn, Hao Cheng, Michael Lucas, Naoto Usuyama |
| DOI | 10.1145/3458754 |
| Citations | 2,000 |
| Source | OpenAlex |