kathrinse / be_great
A novel approach for synthesizing tabular data using pretrained large language models
☆296Updated 3 months ago
Alternatives and similar repositories for be_great:
Users that are interested in be_great are comparing it to the libraries listed below
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆221Updated 2 months ago
- [ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"☆420Updated 7 months ago
- ☆469Updated 6 months ago
- Official git for "CTAB-GAN: Effective Table Data Synthesizing"☆82Updated last year
- Experiments on Tabular Data Models☆272Updated last year
- The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"☆282Updated 3 months ago
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.☆516Updated last month
- Benchmarking synthetic data generation methods.☆267Updated this week
- Official GitHub for CTAB-GAN+☆73Updated 9 months ago
- A framework for prototyping and benchmarking imputation methods☆175Updated last year
- Metrics to evaluate quality and efficacy of synthetic datasets.☆222Updated this week
- Official git for "TabuLa: Harnessing Language Models for Tabular Data Synthesis"☆37Updated 3 months ago
- Compare and ensemble models without retraining☆46Updated this week
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆181Updated this week
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)☆160Updated 2 years ago
- ☆142Updated 10 months ago
- A repo for transfer learning with deep tabular models☆102Updated 2 years ago
- Shapley Interactions and Shapley Values for Machine Learning☆322Updated this week
- For calculating global feature importance using Shapley values.☆264Updated last week
- Evaluate real and synthetic datasets against each other☆86Updated last month
- The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive …☆414Updated 3 years ago
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automat…☆149Updated 2 months ago
- Python package for conformal prediction☆477Updated 4 months ago
- ☆64Updated last year
- A Natural Language Interface to Explainable Boosting Machines☆64Updated 7 months ago
- Official Implementations of "Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space""☆122Updated 7 months ago
- (NeurIPS 2022) On Embeddings for Numerical Features in Tabular Deep Learning☆335Updated 2 months ago
- We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review …☆540Updated last month
- A package for statistically rigorous scientific discovery using machine learning. Implements prediction-powered inference.☆219Updated last month
- Revisiting Pretrarining Objectives for Tabular Deep Learning☆63Updated 2 years ago