Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆184Dec 20, 2024Updated last year
Alternatives and similar repositories for CAAFE
Users that are interested in CAAFE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆44May 2, 2024Updated last year
- Tabular In-Context Learning☆111Mar 6, 2025Updated last year
- Foundation Model for Tabular Data via reticulate☆22Updated this week
- ⚡ Easy API access to the tabular foundation model TabPFN ⚡☆231Updated this week
- Neural Pipeline Search (NePS): Helps deep learning experts find the best neural pipeline.☆78Updated this week
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆23Mar 31, 2025Updated 11 months ago
- ☆15May 26, 2022Updated 3 years ago
- Zero-shot Time Series Forecasting with TabPFN (work accepted at NeurIPS 2024 TRL and TSALM workshops)☆367Mar 12, 2026Updated last week
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Mar 22, 2023Updated 3 years ago
- a minimal website to get the diff of llm rewrites☆11Dec 11, 2024Updated last year
- Amortized Inference for Causal Structure Learning, NeurIPS 2022☆73Feb 11, 2025Updated last year
- The PyExperimenter is a tool for the automatic execution of experiments, e.g. for machine learning (ML), capturing corresponding results …☆39Updated this week
- Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"☆178Mar 22, 2024Updated 2 years ago
- Interpretable ML for TabPFN☆47Jul 13, 2025Updated 8 months ago
- ☆20Jun 3, 2023Updated 2 years ago
- This work introduces LaT-PFN, a novel time series model that combines PFN and JEPA frameworks to generate zero-shot forecasts efficientl…☆20Aug 1, 2024Updated last year
- Our maintained PFN repository. Come here to train SOTA PFNs.☆136Jan 21, 2026Updated 2 months ago
- Official repository for "Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars" (NeurIPS 2023)☆17Oct 26, 2023Updated 2 years ago
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆111Aug 17, 2025Updated 7 months ago
- T-JEPA official repository☆21Sep 28, 2024Updated last year
- ☆30Jan 28, 2025Updated last year
- ☆16Nov 25, 2022Updated 3 years ago
- TabICLv2: A state-of-the-art tabular foundation model☆638Updated this week
- ☆331Jun 19, 2024Updated last year
- ☆45Aug 2, 2024Updated last year
- ☆18Jan 23, 2023Updated 3 years ago
- OpenFE: automated feature generation with expert-level performance☆869May 27, 2024Updated last year
- ☆22Oct 30, 2024Updated last year
- ☆31Jun 24, 2024Updated last year
- In-context Bayesian Optimization☆17Feb 20, 2026Updated last month
- A Framework for Comparing N Hyperparameter Optimizers on M Benchmarks.☆19Mar 16, 2026Updated last week
- sktime - python toolbox for time series: pipelines and transformers☆25Dec 1, 2022Updated 3 years ago
- AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.☆1,162Feb 12, 2026Updated last month
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆35Oct 25, 2024Updated last year
- Multi-Agent System Powered by LLMs for End-to-end Multimodal ML Automation☆259Jan 30, 2026Updated last month
- [NeurIPS 2021] Well-tuned Simple Nets Excel on Tabular Datasets☆89Feb 28, 2023Updated 3 years ago
- A benchmark of meaningful graph datasets with tabular node features☆14Oct 29, 2025Updated 4 months ago
- Drift-Resilient TabPFN is a method using In-Context Learning via a Prior-Data Fitted Network, to address temporal distribution shifts in …☆28May 17, 2025Updated 10 months ago
- Performant, composable online learning☆16Feb 22, 2021Updated 5 years ago