Heterogeneous tabular data are the most commonly used form of data and are essential for numerous critical and computationally demanding applications. Our results, which we have made publicly available as competitive benchmarks, indicate that algorithms based on gradient-boosted tree ensembles still mostly outperform deep learning models on supervised learning tasks, suggesting that the research progress on competitive deep learning models for tabular data is stagnating.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Dec 23, 2022 |
| Journal | IEEE Transactions on Neural Networks and Learning Systems |
| Authors | Vadim Borisov, Tobias Leemann, Kathrin Seßler, Johannes Haug, Martin Pawelczyk |
| DOI | 10.1109/tnnls.2022.3229161 |
| Citations | 844 |
| Source | OpenAlex |