kathrinse / be_great
A novel approach for synthesizing tabular data using pretrained large language models
☆303Updated 4 months ago
Alternatives and similar repositories for be_great:
Users that are interested in be_great are comparing it to the libraries listed below
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆223Updated last week
- Benchmarking synthetic data generation methods.☆270Updated this week
- Official git for "CTAB-GAN: Effective Table Data Synthesizing"☆83Updated last year
- [ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"☆431Updated 8 months ago
- Experiments on Tabular Data Models☆274Updated last year
- Official GitHub for CTAB-GAN+☆73Updated 10 months ago
- The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"☆287Updated 4 months ago
- Official git for "TabuLa: Harnessing Language Models for Tabular Data Synthesis"☆39Updated last month
- Metrics to evaluate quality and efficacy of synthetic datasets.☆227Updated this week
- A framework for prototyping and benchmarking imputation methods☆177Updated last year
- ☆146Updated last year
- ☆473Updated 7 months ago
- ☆64Updated last year
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.☆523Updated 2 months ago
- Compare and ensemble models without retraining☆49Updated this week
- Official Implementations of "Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space""☆130Updated 8 months ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasets☆65Updated 2 years ago
- Evaluate real and synthetic datasets against each other☆86Updated 2 months ago
- [ICLR 2025] TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation☆52Updated this week
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆183Updated last month
- Revisiting Pretrarining Objectives for Tabular Deep Learning☆63Updated 2 years ago
- A Natural Language Interface to Explainable Boosting Machines☆65Updated 8 months ago
- A library of Reversible Data Transforms☆124Updated this week
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)☆160Updated 2 years ago
- Editing machine learning models to reflect human knowledge and values☆124Updated last year
- For calculating global feature importance using Shapley values.☆266Updated this week
- Generating and Imputing Tabular Data via Diffusion and Flow XGBoost Models☆148Updated 7 months ago
- Tabular Deep Learning Library for PyTorch☆636Updated this week
- ☆26Updated 2 years ago
- We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review …☆545Updated last week