noahho / CAAFE
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆155Updated 4 months ago
Alternatives and similar repositories for CAAFE:
Users that are interested in CAAFE are comparing it to the libraries listed below
- ☆31Updated 11 months ago
- ☆300Updated last year
- Compare and ensemble models without retraining☆54Updated this week
- ☆150Updated last year
- Tabular In-Context Learning☆58Updated last month
- Experimental library integrating LLM capabilities to support causal analyses☆128Updated this week
- Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗☆105Updated 3 weeks ago
- ☆64Updated last year
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆125Updated 5 months ago
- ☆52Updated this week
- A collection of AWESOME language modeling techniques on tabular data applications.☆30Updated 6 months ago
- A novel approach for synthesizing tabular data using pretrained large language models☆310Updated 5 months ago
- (ICLR 2024) GRANDE: Gradient-Based Decision Tree Ensembles☆90Updated last month
- ML Assistant for Competitive Machine Learning☆115Updated 2 months ago
- ☆287Updated 10 months ago
- Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".☆137Updated 6 months ago
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆186Updated 2 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆50Updated 2 months ago
- Context is Key: A Benchmark for Forecasting with Essential Textual Information☆60Updated 2 months ago
- A Natural Language Interface to Explainable Boosting Machines☆66Updated 9 months ago
- Revisiting Pretrarining Objectives for Tabular Deep Learning☆63Updated 2 years ago
- The TABLET benchmark for evaluating instruction learning with LLMs for tabular prediction.☆21Updated last year
- ☆70Updated last month
- ☆38Updated 2 years ago
- ML models + benchmark for tabular data classification and regression☆119Updated 3 weeks ago
- TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks☆69Updated 3 weeks ago
- The pioneering neural network surpassing extremely-tuned XGboost and Catboost on varied tabular datasets.☆61Updated 10 months ago
- [ICLR 2025] TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation☆66Updated last week
- A banchmark list for evaluation of large language models.☆99Updated last month
- ☆39Updated 2 months ago