We evaluated the performance of a large language model called ChatGPT on the United States Medical Licensing Exam (USMLE), which consists of three exams: Step 1, Step 2CK, and Step 3. Additionally, ChatGPT demonstrated a high level of concordance and insight in its explanations.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Feb 09, 2023 |
| Journal | PLOS Digital Health |
| Authors | Tiffany H. Kung, Morgan Cheatham, Arielle Medenilla, Czarina Sillos, Lorie De Leon |
| DOI | 10.1371/journal.pdig.0000198 |
| Citations | 3,534 |
| Source | OpenAlex |