Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆186Dec 20, 2024Updated last year
Alternatives and similar repositories for CAAFE
Users that are interested in CAAFE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning (NeurIPS 2024).☆33Mar 4, 2025Updated last year
- ☆44May 2, 2024Updated last year
- Tabular In-Context Learning☆113Mar 6, 2025Updated last year
- Foundation Model for Tabular Data via reticulate☆26Mar 18, 2026Updated 3 weeks ago
- ⚡ Easy API access to the tabular foundation model TabPFN ⚡☆232Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆23Mar 31, 2025Updated last year
- Official implementation of "TabEBM: A Tabular Data Augmentation Method with Class-Specific Energy-Based Models", NeurIPS 2024☆25Aug 19, 2025Updated 7 months ago
- TabPFGen: Synthetic Tabular Data Generation with TabPFN☆39Jul 15, 2025Updated 8 months ago
- ☆15May 26, 2022Updated 3 years ago
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Mar 22, 2023Updated 3 years ago
- Zero-shot Time Series Forecasting with TabPFN (work accepted at NeurIPS 2024 TRL and TSALM workshops)☆379Updated this week
- a minimal website to get the diff of llm rewrites☆11Dec 11, 2024Updated last year
- The PyExperimenter is a tool for the automatic execution of experiments, e.g. for machine learning (ML), capturing corresponding results …☆39Mar 18, 2026Updated 3 weeks ago
- Amortized Inference for Causal Structure Learning, NeurIPS 2022☆73Feb 11, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"☆179Mar 22, 2024Updated 2 years ago
- Interpretable ML for TabPFN☆51Jul 13, 2025Updated 8 months ago
- ☆19Jun 3, 2023Updated 2 years ago
- ⚡ TabPFN: Foundation Model for Tabular Data ⚡☆6,041Updated this week
- This work introduces LaT-PFN, a novel time series model that combines PFN and JEPA frameworks to generate zero-shot forecasts efficientl…☆20Aug 1, 2024Updated last year
- This is the official repo for the paper "LLM-FE"☆65Mar 5, 2026Updated last month
- Our maintained PFN repository. Come here to train SOTA PFNs.☆136Jan 21, 2026Updated 2 months ago
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆111Aug 17, 2025Updated 7 months ago
- ☆16Nov 25, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆32Jan 28, 2025Updated last year
- A learning curve benchmark on OpenML data☆34Nov 29, 2024Updated last year
- A rule-based aproach to explain the output of any machine learning model☆15Apr 4, 2024Updated 2 years ago
- ☆337Jun 19, 2024Updated last year
- ☆45Aug 2, 2024Updated last year
- OpenFE: automated feature generation with expert-level performance☆872May 27, 2024Updated last year
- ☆19Feb 28, 2025Updated last year
- Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24☆233Dec 3, 2024Updated last year
- ☆22Oct 30, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- In-context Bayesian Optimization☆17Feb 20, 2026Updated last month
- A Framework for Comparing N Hyperparameter Optimizers on M Benchmarks.☆19Mar 29, 2026Updated 2 weeks ago
- Computing the gap statistics from Tibshirani et. al. for various clustering algorithms☆13Nov 10, 2025Updated 5 months ago
- LLM-powered Q/A over arXiv preprints☆32Apr 5, 2023Updated 3 years ago
- [NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benc…☆20Nov 12, 2023Updated 2 years ago
- A Living Benchmark for Machine Learning on Tabular Data☆207Apr 4, 2026Updated last week
- sktime - python toolbox for time series: pipelines and transformers☆26Dec 1, 2022Updated 3 years ago