MultiTabPFN: Codebook-based extensions of TabPFN for high-class-count tabular classification

Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

Tabular data are among the most common data types, and TabPFN has recently emerged as a powerful foundation model offering fast, training-free predictions. However, its applicability to high-class-count classification remains limited, as fine-tuning or retraining incurs heavy computational costs. We address this gap by framing multiclass prediction within the Error-Correcting Output Codes (ECOC) paradigm, a training-free approach whose effectiveness depends critically on codebook design and decoding. We present the first systematic study of ECOC-based extensions for TabPFN and introduce MultiTabPFN, a modular framework with Classwise Principal components-based Indexing (CPI)-a novel codebook method that encodes class-level geometry into compact binary codes. Compared to the conventional ECOC constructions, CPI explicitly balances separability and redundancy in the code space, thereby providing a principled path for scaling tabular foundation models to many-class settings. Combined with confidence-aware decoding, MultiTabPFN consistently outperforms standard ECOC baselines across synthetic tasks and 36 real-world benchmarks, establishing a practical and training-free extension of TabPFN to high-class-count tabular classification. © 2026 Elsevier Ltd. All rights are reserved, including those for text and data mining, AI training, and similar technologies.

키워드

Codebook generationMulticlass classificationTabular foundation modelTabular learning
제목
MultiTabPFN: Codebook-based extensions of TabPFN for high-class-count tabular classification
저자
Lee, Kyungeun
DOI
10.1016/j.neunet.2026.108932
발행일
2026-10
유형
Article
저널명
Neural Networks
202