kathrinse / be_great
A novel approach for synthesizing tabular data using pretrained large language models
☆281Updated last week
Related projects ⓘ
Alternatives and complementary repositories for be_great
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆212Updated 3 weeks ago
- Experiments on Tabular Data Models☆268Updated last year
- The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"☆267Updated 8 months ago
- ☆453Updated 2 months ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆211Updated this week
- [ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"☆394Updated 3 months ago
- ☆115Updated 7 months ago
- A framework for prototyping and benchmarking imputation methods☆164Updated last year
- Benchmarking synthetic data generation methods.☆262Updated this week
- WeightedSHAP: analyzing and improving Shapley based feature attributions (NeurIPS 2022)☆160Updated 2 years ago
- Official git for "CTAB-GAN: Effective Table Data Synthesizing"☆80Updated 9 months ago
- A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)☆158Updated last month
- A Natural Language Interface to Explainable Boosting Machines☆60Updated 4 months ago
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.☆451Updated last month
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automat…☆130Updated 9 months ago
- The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive …☆401Updated 2 years ago
- Official GitHub for CTAB-GAN+☆69Updated 5 months ago
- pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation☆104Updated last month
- A repo for transfer learning with deep tabular models☆101Updated last year
- Frouros: an open-source Python library for drift detection in machine learning systems.☆192Updated 3 weeks ago
- Implementation of TabTransformer, attention network for tabular data, in Pytorch☆806Updated 11 months ago
- ☆59Updated last year
- TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks☆50Updated 2 weeks ago
- Evaluate real and synthetic datasets against each other☆80Updated 3 weeks ago
- Train Gradient Boosting models that are both high-performance *and* Fair!☆103Updated 4 months ago
- (NeurIPS 2021) Revisiting Deep Learning Models for Tabular Data☆213Updated 5 months ago
- Editing machine learning models to reflect human knowledge and values☆123Updated last year
- Official git for "TabuLa: Harnessing Language Models for Tabular Data Synthesis"☆29Updated last week
- ☆24Updated last year
- Tabular Deep Learning Library for PyTorch☆537Updated 3 weeks ago