Entity · model

Tabular Foundation Models (TFMs)

modelactivetabular-foundation-models-tfms--78303197·3 events·first seen May 19, 2026

Aliases: Tabular Foundation Models (TFMs), Tabular Foundation Models, tabular foundation models

Co-occurring entities

WSADBench weakly supervised anomaly detection SUFE-AILAB Q-statistic Greedy Ensemble Selection Friedman-Nemenyi Test OpenML Cascade Stacking knowledge distillation Stratified Out-of-Fold Teacher Labeling AUC

More like this (12)

foundation models Visual Geometry Foundation Models Empirical Evaluation of Out-Of-Distribution Performance of Tabular Foundation Models Time Series Foundation Models single-cell foundation model Normal Technology Framework Fields Model Initiative Apple Foundation Models Framework KnowsTFM TabFM TabFM DSR Foundation Model

Recent events (3)

5arXiv · cs.AI·May 26, 2026·source ↗

WSADBench: A Unified Benchmark for Weakly Supervised Anomaly Detection

WSADBench is the first benchmark to unify evaluation across the three primary weakly supervised anomaly detection (WSAD) paradigms—incomplete, inexact, and inaccurate supervision—testing 36 algorithms across 4 modalities with over 700K experiments. Key findings challenge the isolation of current WSAD research directions, showing strong correlations between supervision scenarios and that specialized WSAD methods are quickly outperformed by tabular foundation models as label availability increases. The benchmark also reveals inconsistent utility of unlabeled data and asymmetric model sensitivity to label noise types. Code and datasets are released open-source.

Evaluation and Benchmarking WSADBench weakly supervised anomaly detection SUFE-AILAB +1 more

5arXiv · cs.AI·May 19, 2026·source ↗

Ensembling Tabular Foundation Models: A Diversity Ceiling and a Calibration Trap

This paper benchmarks six ensemble strategies across six tabular foundation models (TFMs) on 153 OpenML classification tasks, finding that ensembling provides minimal gains over the best single TFM. The best ensemble strategy (two-level cascade stacking) achieves only +0.18% accuracy improvement at 253× the compute cost. A key finding is that logistic-regression meta-learner stacking improves accuracy while severely degrading calibration (log-loss), because sharpening class boundaries destroys probability estimates. The authors recommend greedy ensemble selection as the practical default.

Evaluation and Benchmarking Enterprise Deployment Patterns Q-statistic Greedy Ensemble Selection Friedman-Nemenyi Test +3 more

5arXiv · cs.AI·May 19, 2026·source ↗

Distilling Tabular Foundation Models for Structured Health Data

This paper investigates knowledge distillation from tabular foundation models (TFMs) to lightweight student models for healthcare applications. The authors address context leakage in in-context TFMs via stratified out-of-fold teacher labeling, evaluating across 19 healthcare datasets, 6 TFM teachers, and 4 student families. Distilled students retain at least 90% of teacher AUC while running 26× faster on CPU, with preserved calibration and fairness properties. Multi-teacher ensembles do not consistently outperform the best single teacher.

Evaluation and Benchmarking Inference Economics knowledge distillation Stratified Out-of-Fold Teacher Labeling AUC +2 more