A novel approach for synthesizing tabular data using pretrained large language models
☆349Feb 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for be_great
Users that are interested in be_great are comparing it to the libraries listed below
Sorting:
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆243Jan 4, 2026Updated 2 months ago
- ☆68May 23, 2023Updated 2 years ago
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.☆640Feb 11, 2026Updated 3 weeks ago
- ☆27Mar 9, 2023Updated 2 years ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆256Feb 26, 2026Updated last week
- [ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"☆532Jul 13, 2024Updated last year
- Benchmarking synthetic data generation methods.☆300Updated this week
- Official git for "CTAB-GAN: Effective Table Data Synthesizing"☆95Jan 19, 2024Updated 2 years ago
- PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)☆15Oct 10, 2022Updated 3 years ago
- The code for the paper "MediTab: Scaling Medical Tabular Data Predictors via Data Consolidation, Enrichment, and Refinement"☆23May 8, 2024Updated last year
- Evaluate real and synthetic datasets against each other☆92Jul 28, 2025Updated 7 months ago
- Experiments on Tabular Data Models☆280May 25, 2023Updated 2 years ago
- A repo for transfer learning with deep tabular models☆104Feb 15, 2023Updated 3 years ago
- Conditional GAN for generating synthetic tabular data.☆1,525Feb 22, 2026Updated last week
- Official code for "CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis", ICML 2023☆37Jan 9, 2024Updated 2 years ago
- Code and data for "TURL: Table Understanding through Representation Learning"☆135Nov 23, 2025Updated 3 months ago
- ☆337Nov 2, 2023Updated 2 years ago
- tableGAN is a synthetic data generation technique (Data Synthesis based on Generative Adversarial Networks paper) based on Generative Ad…☆153May 1, 2019Updated 6 years ago
- The TABLET benchmark for evaluating instruction learning with LLMs for tabular prediction.☆25Apr 28, 2023Updated 2 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- A library of Reversible Data Transforms☆131Feb 23, 2026Updated last week
- Implementation of the paper: "FedTabDiff: Federated Learning of Diffusion Models for Synthetic Mixed-Type Tabular Data Generation"☆21Nov 10, 2024Updated last year
- Generating Tabular Synthetic Data using State of the Art GAN architecture☆82Apr 29, 2020Updated 5 years ago
- Synthetic data generation for tabular data☆3,434Updated this week
- We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review …☆564Jun 24, 2025Updated 8 months ago
- ⚡ TabPFN: Foundation Model for Tabular Data ⚡☆5,766Updated this week
- NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables☆214Mar 13, 2025Updated 11 months ago
- A modular Python framework for standardized evaluation and benchmarking of online learning models.☆10Nov 24, 2022Updated 3 years ago
- A code for the NeurIPS 2022 Table Representation Learning Workshop paper: "Diffusion models for missing value imputation in tabular data"☆57Jun 20, 2024Updated last year
- ☆24Sep 16, 2022Updated 3 years ago
- Individual Coefficient Approximation for Risk Estimation (ICARE) model☆18Sep 9, 2023Updated 2 years ago
- Code for Transformed Distribution Matching (TDM) for Missing Value Imputation, ICML 2023☆14Aug 4, 2023Updated 2 years ago
- Resources for PVLDB 2023 submission☆25Aug 28, 2024Updated last year
- Tabular In-Context Learning☆109Mar 6, 2025Updated 11 months ago
- [NeurIPS 2021] Well-tuned Simple Nets Excel on Tabular Datasets☆88Feb 28, 2023Updated 3 years ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆672Jun 24, 2025Updated 8 months ago
- ☆15Dec 15, 2015Updated 10 years ago
- A collection of AWESOME language modeling techniques on tabular data applications.☆32Oct 14, 2024Updated last year
- Generating and Imputing Tabular Data via Diffusion and Flow XGBoost Models☆181Aug 6, 2024Updated last year