automl / CAAFE
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
☆121Updated 7 months ago
Related projects: ⓘ
- Salesforce CausalAI Library: A Fast and Scalable framework for Causal Analysis of Time Series and Tabular Data☆255Updated 10 months ago
- A novel approach for synthesizing tabular data using pretrained large language models☆271Updated 3 months ago
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆120Updated 5 months ago
- ☆57Updated last year
- ☆266Updated 10 months ago
- Experimental library integrating LLM capabilities to support causal analyses☆70Updated last week
- Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data☆273Updated last month
- ☆109Updated 5 months ago
- Scikit-learn friendly library to interpret, and prompt-engineer text datasets using large language models.☆151Updated 2 months ago
- Revisiting Pretrarining Objectives for Tabular Deep Learning☆60Updated 2 years ago
- Mothernet: A Foundational Hypernetwork for Tabular Classification☆26Updated this week
- A Natural Language Interface to Explainable Boosting Machines☆59Updated 2 months ago
- ☆60Updated 6 months ago
- Code for paper: Are Large Language Models Post Hoc Explainers?☆21Updated last month
- The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"☆256Updated 6 months ago
- Data and code for the Corr2Cause paper (ICLR 2024)☆79Updated 5 months ago
- ☆30Updated last year
- ☆33Updated this week
- Extending Conformal Prediction to LLMs☆53Updated 2 months ago
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆149Updated this week
- RelBench: Relational Deep Learning Benchmark☆183Updated 2 weeks ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆203Updated last month
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆87Updated 3 months ago
- Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".☆47Updated last month
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)☆161Updated last year
- ☆14Updated 4 months ago
- Testing Language Models for Memorization of Tabular Datasets.☆26Updated last week
- (ICLR 2024) GRANDE: Gradient-Based Decision Tree Ensembles☆84Updated 2 weeks ago
- A Python toolbox for conformal prediction research on deep learning models, using PyTorch.☆222Updated last week
- Tabular Deep Learning Library for PyTorch☆498Updated this week