Benchmarking synthetic data generation methods.
☆304Mar 16, 2026Updated this week
Alternatives and similar repositories for SDGym
Users that are interested in SDGym are comparing it to the libraries listed below
Sorting:
- A library of Reversible Data Transforms☆133Feb 23, 2026Updated 3 weeks ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆256Updated this week
- Conditional GAN for generating synthetic tabular data.☆1,532Updated this week
- Generative adversarial training for generating synthetic tabular data.☆296Nov 26, 2022Updated 3 years ago
- Synthetic Data Generation for mixed-type, multivariate time series.☆120Feb 23, 2026Updated 3 weeks ago
- Data Lineage Tracing Library☆24Nov 30, 2021Updated 4 years ago
- A library to model multivariate data using copulas.☆635Updated this week
- tableGAN is a synthetic data generation technique (Data Synthesis based on Generative Adversarial Networks paper) based on Generative Ad…☆154May 1, 2019Updated 6 years ago
- We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review …☆565Mar 7, 2026Updated 2 weeks ago
- Evaluate real and synthetic datasets against each other☆92Jul 28, 2025Updated 7 months ago
- A novel approach for synthesizing tabular data using pretrained large language models☆350Feb 9, 2026Updated last month
- Official GitHub for CTAB-GAN+☆85May 14, 2024Updated last year
- Official code for "STaSy: Score-based Tabular data Synthesis", ICLR 2023☆35Aug 11, 2023Updated 2 years ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆673Jun 24, 2025Updated 8 months ago
- ☆14Dec 23, 2020Updated 5 years ago
- ☆43Dec 7, 2022Updated 3 years ago
- A curated list of awesome resources for creating synthetic data☆45Feb 16, 2022Updated 4 years ago
- ☆275Apr 3, 2024Updated last year
- Official code for "CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis", ICML 2023☆37Jan 9, 2024Updated 2 years ago
- A toolbox for differentially private data generation☆130Jul 6, 2023Updated 2 years ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆244Jan 4, 2026Updated 2 months ago
- Generative adversarial network for generating electronic health records.☆283Aug 19, 2019Updated 6 years ago
- SAP Security research sample code and tutorials for generating differentially private synthetic datasets using generative deep learning m…☆22Mar 7, 2024Updated 2 years ago
- COR-GAN: Correlation-Capturing Convolutional Neural Networks for Generating Synthetic Healthcare Records☆56Dec 15, 2020Updated 5 years ago
- Tools and service for differentially private processing of tabular and relational data☆293Mar 7, 2026Updated 2 weeks ago
- Differentially Private (tabular) Generative Models Papers with Code☆54Jul 2, 2024Updated last year
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.☆643Feb 11, 2026Updated last month
- Synthetic data generators for tabular and time-series data☆1,615Mar 2, 2026Updated 2 weeks ago
- Source code of paper "Differentially Private Generative Adversarial Network"☆71Nov 29, 2018Updated 7 years ago
- Differentially-private Wasserstein GAN implementation in PyTorch☆28Nov 1, 2019Updated 6 years ago
- [ICML 2023] The official implementation of the paper "TabDDPM: Modelling Tabular Data with Diffusion Models"☆533Jul 13, 2024Updated last year
- An implementation of the differentially private variational inference algorithm for NumPyro.☆16Sep 3, 2024Updated last year
- SDNist: Benchmark data and evaluation tools for data synthesizers.☆39Jul 16, 2025Updated 8 months ago
- A simple, extensible library for developing AutoML systems☆175Jul 28, 2023Updated 2 years ago
- A hands-on tutorial showing how to use Python to do anonymisation with synthetic data☆81Apr 14, 2022Updated 3 years ago
- Differentially Private Synthetic Data Generation [DP-SDG] - Experimental Setups & Knowledge Base - WORK IN PROGRESS☆12Jul 26, 2022Updated 3 years ago
- State space and deep generative models for time series.☆54Jul 31, 2023Updated 2 years ago
- Standardised Metrics and Methods for Synthetic Tabular Data Evaluation☆36Aug 14, 2024Updated last year
- [IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions☆311Nov 3, 2023Updated 2 years ago