noahho / CAAFELinks
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆180Updated last year
Alternatives and similar repositories for CAAFE
Users that are interested in CAAFE are comparing it to the libraries listed below
Sorting:
- Multi-Agent System Powered by LLMs for End-to-end Multimodal ML Automation☆253Updated last week
- ☆335Updated 2 years ago
- Tabular In-Context Learning☆107Updated 11 months ago
- ☆42Updated last year
- A novel approach for synthesizing tabular data using pretrained large language models☆344Updated 2 months ago
- Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"☆177Updated last year
- Interpret text data with LLMs (sklearn compatible).☆176Updated 2 weeks ago
- Experimental library integrating LLM capabilities to support causal analyses☆287Updated last month
- Salesforce CausalAI Library: A Fast and Scalable framework for Causal Analysis of Time Series and Tabular Data☆313Updated 9 months ago
- A Living Benchmark for Machine Learning on Tabular Data☆184Updated last week
- ☆68Updated 2 years ago
- Repository for TabICL: A Tabular Foundation Model for In-Context Learning on Large Data☆315Updated 2 months ago
- Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data☆422Updated last year
- Zero-shot Time Series Forecasting with TabPFN (work accepted at NeurIPS 2024 TRL and TSALM workshops)☆345Updated 2 weeks ago
- LTSM-Bundle: A Toolbox and Benchmark on Large Language Models for Time Series Forecasting☆105Updated 5 months ago
- Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".☆187Updated last year
- Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗☆249Updated last week
- Awesome Tabular Deep Learning for "Representation Learning for Tabular Data: A Comprehensive Survey"☆96Updated 3 weeks ago
- Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24☆225Updated last year
- Context is Key: A Benchmark for Forecasting with Essential Textual Information☆86Updated 6 months ago
- ☆66Updated 2 weeks ago
- The pioneering neural network surpassing extremely-tuned XGboost and Catboost on varied tabular datasets.☆68Updated last year
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆210Updated 3 months ago
- ☆328Updated last year
- ☆503Updated last year
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆23Updated 10 months ago
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆160Updated 3 months ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆243Updated last month
- ☆16Updated 3 years ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆99Updated last year