noahho / CAAFE
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆151Updated 2 months ago
Alternatives and similar repositories for CAAFE:
Users that are interested in CAAFE are comparing it to the libraries listed below
- ☆25Updated 10 months ago
- Experimental library integrating LLM capabilities to support causal analyses☆110Updated 5 months ago
- ML Assistant for Competitive Machine Learning☆110Updated last week
- Tabular In-Context Learning☆47Updated this week
- ☆290Updated last year
- ☆144Updated 11 months ago
- Compare and ensemble models without retraining☆47Updated this week
- ☆65Updated last year
- Salesforce CausalAI Library: A Fast and Scalable framework for Causal Analysis of Time Series and Tabular Data☆275Updated last year
- The collection of resources about LLM for Time series tasks☆134Updated 8 months ago
- ML models + benchmark for tabular data classification and regression☆76Updated 3 weeks ago
- The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"☆283Updated 3 months ago
- Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".☆115Updated 4 months ago
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆184Updated 2 weeks ago
- A novel approach for synthesizing tabular data using pretrained large language models☆299Updated 4 months ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆28Updated 4 months ago
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆123Updated 3 months ago
- [ICLR 2025] TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation☆47Updated this week
- Understanding Different Design Choices in Training Large Time Series Models☆87Updated this week
- Context is Key: A Benchmark for Forecasting with Essential Textual Information☆53Updated 3 weeks ago
- TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks☆62Updated 3 months ago
- Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data☆369Updated 2 months ago
- ☆68Updated 2 months ago
- A Natural Language Interface to Explainable Boosting Machines☆65Updated 7 months ago
- Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗☆73Updated this week
- The pioneering neural network surpassing extremely-tuned XGboost and Catboost on varied tabular datasets.☆58Updated 8 months ago
- Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24☆157Updated 3 months ago
- Scikit-learn friendly library to interpret, and prompt-engineer text datasets using large language models.☆163Updated this week
- ☆275Updated 8 months ago