Established Machine Learning Matches Tabular Foundation Models in Clinical Predictions

Shaktah, L. A.; Gustav, M.; Lenz, T.; Liang, J.; Hilgers, L.; Carrero, Z. I.; Kather, J. N.

2026-02-04 health informatics

10.64898/2026.02.02.26345274 medRxiv

Show abstract

Foundation models (FMs) promise to standardise predictive modeling across domains, yet their clinical value for tabular data remains unproven. To test this, we performed a large, fully reproducible benchmark of TabPFN, a leading FM for tabular prediction, against twelve established machine learning (ML) methods across twelve binary clinical tasks. Cohorts spanned 788 - 139,528 patients across diverse outcomes, including survival, metastasis, and disease status. Using standardized preprocessing, bootstrapping, and multiple performance metrics, TabPFN was generally competitive but did not consistently outperform strong ML baselines. It exceeded the best ML model in only 16.7% of tasks, with most area under the receiver operating characteristic (AUROC) differences within {+/-}0.01. TabPFN also incurred higher computational cost, with median runtimes 5.5x longer and practical reliance on GPU acceleration. These findings indicate that, for routine clinical tabular prediction, TabPFN offers limited performance gains relative to optimized ML methods, while introducing significant efficiency trade-offs.

Matching journals

●Non-profit ◐University press ○Commercial

The top 7 journals account for 50% of the predicted probability mass.

Only show non-profit

npj Digital Medicine

○ 97 papers in training set

Nature Biomedical Engineering

○ 42 papers in training set

Nature Communications

○ 4913 papers in training set

Nature Machine Intelligence

○ 61 papers in training set

Scientific Reports

○ 3102 papers in training set

JCO Clinical Cancer Informatics

● 18 papers in training set

○ 70 papers in training set

50% of probability mass above

Cell Reports Medicine

○ 140 papers in training set

Nature Medicine

○ 117 papers in training set

Communications Medicine

○ 85 papers in training set

● 4510 papers in training set

Briefings in Bioinformatics

◐ 326 papers in training set

◐ 1061 papers in training set

The Lancet Digital Health

○ 25 papers in training set

○ 130 papers in training set

Communications Biology

○ 886 papers in training set

Journal of Medical Internet Research

◐ 85 papers in training set

Nature Computational Science

○ 50 papers in training set

Medical Image Analysis

○ 33 papers in training set

Advanced Science

○ 249 papers in training set

◐ 147 papers in training set

Science Translational Medicine

● 111 papers in training set

○ 38 papers in training set

PLOS Digital Health

● 91 papers in training set

● 5422 papers in training set

IEEE Journal of Biomedical and Health Informatics

● 34 papers in training set

JMIR Medical Informatics

◐ 17 papers in training set

European Respiratory Journal

● 54 papers in training set

Annals of Internal Medicine

● 27 papers in training set

Journal of the American Medical Informatics Association

◐ 61 papers in training set