Accurate text classification and placement remain challenges in U.S. Gemini and Claude also demonstrated strong correlation with human ratings, with Claude achieving the highest Pearson scores (ρ = 0.75; 1-step, ρ = 0.73; 2-step) vs.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Jan 01, 2025 |
| Journal | Dagstuhl Research Online Publication Server |
| Authors | Da Corte, Miguel, Baptista, Jorge |
| DOI | 10.4230/oasics.slate.2025.1 |
| Citations | 739 |
| Source | OpenAlex |