Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆192Dec 20, 2024Updated last year
Alternatives and similar repositories for CAAFE
Users that are interested in CAAFE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning (NeurIPS 2024).☆34Mar 4, 2025Updated last year
- Tabular In-Context Learning☆116Mar 6, 2025Updated last year
- ⚡ Easy API access to the tabular foundation model TabPFN ⚡☆246Jun 24, 2026Updated last week
- Neural Pipeline Search (NePS): Helps deep learning experts find the best neural pipeline.☆80Jun 22, 2026Updated last week
- Code accompanying https://arxiv.org/abs/1802.02219☆20Oct 5, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆24Mar 31, 2025Updated last year
- Official implementation of "TabEBM: A Tabular Data Augmentation Method with Class-Specific Energy-Based Models", NeurIPS 2024☆25Aug 19, 2025Updated 10 months ago
- ☆15May 26, 2022Updated 4 years ago
- TabPFGen: Synthetic Tabular Data Generation with TabPFN☆42Jul 15, 2025Updated 11 months ago
- An ensemble-based, size-agnostic wrapper around the TabPFN classifier for chemical datasets.☆35May 18, 2024Updated 2 years ago
- Zero-shot Time Series Forecasting with TabPFN (work accepted at NeurIPS 2024 TRL and TSALM workshops)☆427Jun 17, 2026Updated 2 weeks ago
- a minimal website to get the diff of llm rewrites☆11Dec 11, 2024Updated last year
- Amortized Inference for Causal Structure Learning, NeurIPS 2022☆79Feb 11, 2025Updated last year
- Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"☆182Mar 22, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Interpretable ML for TabPFN☆53Jul 13, 2025Updated 11 months ago
- This work introduces LaT-PFN, a novel time series model that combines PFN and JEPA frameworks to generate zero-shot forecasts efficientl…☆22Aug 1, 2024Updated last year
- ⚡ TabPFN: Foundation Model for Tabular Data ⚡☆7,435Updated this week
- Our maintained PFN repository. Come here to train SOTA PFNs.☆148Jan 21, 2026Updated 5 months ago
- A learning curve benchmark on OpenML data☆34Nov 29, 2024Updated last year
- ☆16Nov 25, 2022Updated 3 years ago
- ☆39May 6, 2026Updated last month
- A rule-based aproach to explain the output of any machine learning model☆17Apr 4, 2024Updated 2 years ago
- ☆343Jun 19, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21Jan 13, 2022Updated 4 years ago
- ☆46Aug 2, 2024Updated last year
- TabICLv2: A state-of-the-art tabular foundation model☆1,050Jun 8, 2026Updated 3 weeks ago
- Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24☆234Dec 3, 2024Updated last year
- ☆23Oct 30, 2024Updated last year
- ☆32Jun 24, 2024Updated 2 years ago
- Official repository for the paper "Zero-Shot AutoML with Pretrained Models"☆48Dec 29, 2023Updated 2 years ago
- A Framework for Comparing N Hyperparameter Optimizers on M Benchmarks.☆20Updated this week
- Computing the gap statistics from Tibshirani et. al. for various clustering algorithms☆14Nov 10, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benc…☆20Nov 12, 2023Updated 2 years ago
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆35Oct 25, 2024Updated last year
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- A Living Benchmark for Machine Learning on Tabular Data☆247Updated this week
- AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.☆1,332May 2, 2026Updated 2 months ago
- [NeurIPS 2021] Well-tuned Simple Nets Excel on Tabular Datasets☆89Feb 28, 2023Updated 3 years ago
- Multi-Agent System Powered by LLMs for End-to-end Multimodal ML Automation☆289Mar 20, 2026Updated 3 months ago