noahho / CAAFELinks
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆163Updated 6 months ago
Alternatives and similar repositories for CAAFE
Users that are interested in CAAFE are comparing it to the libraries listed below
Sorting:
- Tabular In-Context Learning☆74Updated 3 months ago
- ☆35Updated last year
- ☆310Updated last year
- Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"☆162Updated last year
- A Living Benchmark for Machine Learning on Tabular Data☆86Updated this week
- Interpret text data using LLMs (scikit-learn compatible).☆166Updated 2 weeks ago
- ☆66Updated 2 years ago
- Multi-Agent System for End-to-end Multimodal ML Automation☆135Updated last week
- ☆53Updated 2 months ago
- Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24☆188Updated 6 months ago
- Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗☆142Updated this week
- Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".☆159Updated 8 months ago
- Awesome Tabular Deep Learning for "Representation Learning for Tabular Data: A Comprehensive Survey"☆34Updated last month
- Understanding Different Design Choices in Training Large Time Series Models☆95Updated 2 months ago
- The pioneering neural network surpassing extremely-tuned XGboost and Catboost on varied tabular datasets.☆62Updated last year
- Repository for TabICL: A Tabular Foundation Model for In-Context Learning on Large Data☆97Updated last week
- Zero-shot Time Series Forecasting with TabPFN (work accepted at NeurIPS 2024 TRL and TSALM workshops)☆215Updated this week
- ☆294Updated last year
- ☆70Updated 3 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆55Updated 4 months ago
- ☆226Updated 6 months ago
- A novel approach for synthesizing tabular data using pretrained large language models☆310Updated last month
- A collection of AWESOME language modeling techniques on tabular data applications.☆31Updated 8 months ago
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆194Updated 4 months ago
- (ICLR 2025 Spotlight) TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks☆75Updated 3 weeks ago
- Code for finetuning TabPFN on one downstream tabular dataset.☆61Updated last month
- The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"☆295Updated 7 months ago
- Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data☆398Updated 6 months ago
- ☆38Updated 2 years ago
- Experimental library integrating LLM capabilities to support causal analyses☆216Updated 2 weeks ago