noahho / CAAFE
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆154Updated 3 months ago
Alternatives and similar repositories for CAAFE:
Users that are interested in CAAFE are comparing it to the libraries listed below
- ☆28Updated 11 months ago
- ☆293Updated last year
- Tabular In-Context Learning☆52Updated 3 weeks ago
- ☆64Updated last year
- Compare and ensemble models without retraining☆50Updated this week
- ML Assistant for Competitive Machine Learning☆112Updated last month
- ☆147Updated last year
- Context is Key: A Benchmark for Forecasting with Essential Textual Information☆58Updated last month
- A collection of AWESOME language modeling techniques on tabular data applications.☆29Updated 5 months ago
- Experimental library integrating LLM capabilities to support causal analyses☆120Updated this week
- ☆50Updated 7 months ago
- Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗☆91Updated this week
- A novel approach for synthesizing tabular data using pretrained large language models☆305Updated 5 months ago
- ☆284Updated 9 months ago
- Zero-shot Time Series Forecasting with TabPFN (work accepted at NeurIPS 2024 TRL and TSALM workshops)☆149Updated this week
- Interpret text data using LLMs (scikit-learn compatible).☆163Updated 2 weeks ago
- ML models + benchmark for tabular data classification and regression☆114Updated this week
- [ICLR 2025] TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation☆61Updated 2 weeks ago
- Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data☆377Updated 3 months ago
- Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".☆130Updated 5 months ago
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆18Updated this week
- Implementation of the paper: WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engin…☆83Updated 11 months ago
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆184Updated last month
- Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24☆171Updated 4 months ago
- ☆73Updated last year
- The pioneering neural network surpassing extremely-tuned XGboost and Catboost on varied tabular datasets.☆60Updated 9 months ago
- Understanding Different Design Choices in Training Large Time Series Models☆89Updated 3 weeks ago
- NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables☆186Updated 3 weeks ago
- Code for finetuning TabPFN on one downstream tabular dataset.☆34Updated last week
- ☆205Updated 3 months ago