noahho / CAAFELinks

Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).

☆167

Alternatives and similar repositories for CAAFE

Users that are interested in CAAFE are comparing it to the libraries listed below

Sorting:

clinicalml / TabLLM
☆317Updated last year
microsoft / ticl
Tabular In-Context Learning
☆82Updated 5 months ago
Sungwon-Han / FeatLLM
☆36Updated last year
autogluon / autogluon-assistant
Multi-Agent System Powered by LLMs for End-to-end Multimodal ML Automation
☆159Updated this week
tabularis-ai / be_great
A novel approach for synthesizing tabular data using pretrained large language models
☆317Updated last month
py-why / pywhyllm
Experimental library integrating LLM capabilities to support causal analyses
☆230Updated this week
naszilla / tabzilla
Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"
☆163Updated last year
csinva / imodelsX
Interpret text data using LLMs (scikit-learn compatible).
☆169Updated this week
autogluon / tabrepo
A Living Benchmark for Machine Learning on Tabular Data
☆108Updated this week
ZhangTP1996 / TapTap
☆67Updated 2 years ago
johnnyhwu / Awesome-LLM-Tabular
Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data
☆409Updated 7 months ago
ServiceNow / context-is-key-forecasting
Context is Key: A Benchmark for Forecasting with Essential Textual Information
☆70Updated this week
salesforce / causalai
Salesforce CausalAI Library: A Fast and Scalable framework for Causal Analysis of Time Series and Tabular Data
☆294Updated 3 months ago
wwweiwei / awesome-self-supervised-learning-for-tabular-data
A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)
☆196Updated 5 months ago
yandex-research / tabred
(ICLR 2025 Spotlight) TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks
☆77Updated 2 months ago
PriorLabs / tabpfn-extensions
Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗
☆165Updated this week
PriorLabs / tabpfn-time-series
Zero-shot Time Series Forecasting with TabPFN (work accepted at NeurIPS 2024 TRL and TSALM workshops)
☆244Updated this week
lanxiang1017 / Language-Modeling-on-Tabular-Data-Survey
A collection of AWESOME language modeling techniques on tabular data applications.
☆32Updated 9 months ago
WhatAShot / ExcelFormer
The pioneering neural network surpassing extremely-tuned XGboost and Catboost on varied tabular datasets.
☆64Updated last year
mlfoundations / rtfm
Research on Tabular Foundation Models
☆54Updated 7 months ago
worldbank / REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
☆235Updated 3 weeks ago
PriorLabs / tabpfn-client
⚡ Easy API access to the tabular foundation model TabPFN ⚡
☆187Updated last week
soda-inria / tabicl
Repository for TabICL: A Tabular Foundation Model for In-Context Learning on Large Data
☆150Updated 3 weeks ago
google-research / optformer
☆224Updated last month
datamllab / ltsm
LTSM-Bundle: A Toolbox and Benchmark on Large Language Models for Time Series Forecasting
☆98Updated 3 weeks ago
yandex-research / tabular-dl-tabr
The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"
☆300Updated 8 months ago
tanfiona / LLM-on-Tabular-Data-Prediction-Table-Understanding-Data-Generation
Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".
☆164Updated 9 months ago
UW-Madison-Lee-Lab / LanguageInterfacedFineTuning
Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.
☆129Updated 8 months ago
LeoGrin / tabular-benchmark
☆484Updated 11 months ago
puhsu / tabular-dl-pretrain-objectives
Revisiting Pretrarining Objectives for Tabular Deep Learning
☆65Updated 2 years ago