Almanac
model

Tabular Foundation Models (TFMs)

modelactivetabular-foundation-models-tfms--78303197·3 events·first seen 29d ago

Aliases: Tabular Foundation Models (TFMs), Tabular Foundation Models, tabular foundation models

Co-occurring entities

More like this (12)

Recent events (3)

5arXiv · cs.AI·29d ago·source ↗

Distilling Tabular Foundation Models for Structured Health Data

This paper investigates knowledge distillation from tabular foundation models (TFMs) to lightweight student models for healthcare applications. The authors address context leakage in in-context TFMs via stratified out-of-fold teacher labeling, evaluating across 19 healthcare datasets, 6 TFM teachers, and 4 student families. Distilled students retain at least 90% of teacher AUC while running 26× faster on CPU, with preserved calibration and fairness properties. Multi-teacher ensembles do not consistently outperform the best single teacher.

5arXiv · cs.AI·29d ago·source ↗

Ensembling Tabular Foundation Models: A Diversity Ceiling and a Calibration Trap

This paper benchmarks six ensemble strategies across six tabular foundation models (TFMs) on 153 OpenML classification tasks, finding that ensembling provides minimal gains over the best single TFM. The best ensemble strategy (two-level cascade stacking) achieves only +0.18% accuracy improvement at 253× the compute cost. A key finding is that logistic-regression meta-learner stacking improves accuracy while severely degrading calibration (log-loss), because sharpening class boundaries destroys probability estimates. The authors recommend greedy ensemble selection as the practical default.

5arXiv · cs.AI·22d ago·source ↗

WSADBench: A Unified Benchmark for Weakly Supervised Anomaly Detection

WSADBench is the first benchmark to unify evaluation across the three primary weakly supervised anomaly detection (WSAD) paradigms—incomplete, inexact, and inaccurate supervision—testing 36 algorithms across 4 modalities with over 700K experiments. Key findings challenge the isolation of current WSAD research directions, showing strong correlations between supervision scenarios and that specialized WSAD methods are quickly outperformed by tabular foundation models as label availability increases. The benchmark also reveals inconsistent utility of unlabeled data and asymmetric model sensitivity to label noise types. Code and datasets are released open-source.