tabularis-ai / be_great
A novel approach for synthesizing tabular data using pretrained large language models
☆310Updated this week
Alternatives and similar repositories for be_great
Users that are interested in be_great are comparing it to the libraries listed below
Sorting:
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆226Updated 2 months ago
- The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"☆291Updated 6 months ago
- Official git for "CTAB-GAN: Effective Table Data Synthesizing"☆85Updated last year
- [ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"☆447Updated 10 months ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆233Updated 3 weeks ago
- Experiments on Tabular Data Models☆277Updated last year
- Benchmarking synthetic data generation methods.☆273Updated last week
- Official GitHub for CTAB-GAN+☆75Updated 11 months ago
- ☆154Updated last year
- Official Implementations of "Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space""☆143Updated 9 months ago
- Official git for "TabuLa: Harnessing Language Models for Tabular Data Synthesis"☆39Updated last week
- A repo for transfer learning with deep tabular models☆102Updated 2 years ago
- ☆302Updated last year
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆189Updated 2 months ago
- ☆478Updated 8 months ago
- Tabular In-Context Learning☆61Updated 2 months ago
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)☆160Updated 2 years ago
- A Natural Language Interface to Explainable Boosting Machines☆66Updated 10 months ago
- Generating and Imputing Tabular Data via Diffusion and Flow XGBoost Models☆153Updated 9 months ago
- ☆64Updated last year
- Compare and ensemble models without retraining☆55Updated this week
- Tabular Deep Learning Library for PyTorch☆653Updated last week
- A framework for prototyping and benchmarking imputation methods☆183Updated 2 years ago
- Repository for TabICL: A Tabular Foundation Model for In-Context Learning on Large Data☆46Updated last week
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.☆547Updated this week
- ☆27Updated 2 years ago
- ☆199Updated this week
- Interpret text data using LLMs (scikit-learn compatible).☆163Updated last month
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automat…☆158Updated 4 months ago
- Revisiting Pretrarining Objectives for Tabular Deep Learning☆63Updated 2 years ago