Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆192Dec 20, 2024Updated last year
Alternatives and similar repositories for CAAFE
Users that are interested in CAAFE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆46May 2, 2024Updated 2 years ago
- Tabular In-Context Learning☆115Mar 6, 2025Updated last year
- Foundation Model for Tabular Data via reticulate☆33May 26, 2026Updated 2 weeks ago
- Neural Pipeline Search (NePS): Helps deep learning experts find the best neural pipeline.☆79Updated this week
- Official implementation of "TabEBM: A Tabular Data Augmentation Method with Class-Specific Energy-Based Models", NeurIPS 2024☆25Aug 19, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15May 26, 2022Updated 4 years ago
- TabPFGen: Synthetic Tabular Data Generation with TabPFN☆42Jul 15, 2025Updated 10 months ago
- Ensemble-based, size-agnostic wrapper for the TabPFN classifier☆35May 18, 2024Updated 2 years ago
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Mar 22, 2023Updated 3 years ago
- Zero-shot Time Series Forecasting with TabPFN (work accepted at NeurIPS 2024 TRL and TSALM workshops)☆414Jun 4, 2026Updated last week
- a minimal website to get the diff of llm rewrites☆11Dec 11, 2024Updated last year
- The PyExperimenter is a tool for the automatic execution of experiments, e.g. for machine learning (ML), capturing corresponding results …☆39Mar 18, 2026Updated 2 months ago
- Amortized Inference for Causal Structure Learning, NeurIPS 2022☆78Feb 11, 2025Updated last year
- Interpretable ML for TabPFN☆53Jul 13, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This work introduces LaT-PFN, a novel time series model that combines PFN and JEPA frameworks to generate zero-shot forecasts efficientl…☆22Aug 1, 2024Updated last year
- ⚡ TabPFN: Foundation Model for Tabular Data ⚡☆7,295Updated this week
- Our maintained PFN repository. Come here to train SOTA PFNs.☆146Jan 21, 2026Updated 4 months ago
- A learning curve benchmark on OpenML data☆34Nov 29, 2024Updated last year
- ☆16Nov 25, 2022Updated 3 years ago
- A rule-based aproach to explain the output of any machine learning model☆17Apr 4, 2024Updated 2 years ago
- ☆342Jun 19, 2024Updated last year
- TabICLv2: A state-of-the-art tabular foundation model☆936Jun 5, 2026Updated last week
- ☆46Aug 2, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- OpenFE: automated feature generation with expert-level performance☆873May 27, 2024Updated 2 years ago
- ☆19Feb 28, 2025Updated last year
- ☆23Oct 30, 2024Updated last year
- Official repository for the paper "Zero-Shot AutoML with Pretrained Models"☆48Dec 29, 2023Updated 2 years ago
- Awesome list of AutoML frameworks - curated by @oskar-j☆32Feb 15, 2023Updated 3 years ago
- [NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benc…☆20Nov 12, 2023Updated 2 years ago
- sktime - python toolbox for time series: pipelines and transformers☆26Dec 1, 2022Updated 3 years ago
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆35Oct 25, 2024Updated last year
- A Living Benchmark for Machine Learning on Tabular Data☆240Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multi-Agent System Powered by LLMs for End-to-end Multimodal ML Automation☆286Mar 20, 2026Updated 2 months ago
- A benchmark of meaningful graph datasets with tabular node features☆16Oct 29, 2025Updated 7 months ago
- The official implementation of PFNs4BO: In-Context Learning for Bayesian Optimization☆43Sep 18, 2025Updated 8 months ago
- ☆91Jan 27, 2026Updated 4 months ago
- Performant, composable online learning☆16Feb 22, 2021Updated 5 years ago
- Code release for DeepEDM (ICML 2025)☆29Jan 20, 2026Updated 4 months ago
- ☆15Apr 18, 2024Updated 2 years ago