π A curated list of resources dedicated to synthetic data
β141Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Proof of concept code from Gretel.ai and Illumina using generative neural networks to create synthetic versions of mouse genotype and pheβ¦β33Jan 19, 2022Updated 4 years ago
- A curated list of awesome synthetic data tools (open source and commercial).β245Jan 11, 2024Updated 2 years ago
- Public blueprints for data use casesβ84Sep 10, 2025Updated 6 months ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β675Jun 24, 2025Updated 9 months ago
- A curated list of awesome resources for creating synthetic dataβ45Feb 16, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β41Sep 6, 2023Updated 2 years ago
- Java interface to tauargusβ14Sep 26, 2025Updated 6 months ago
- β43Dec 7, 2022Updated 3 years ago
- A Unified Framework for Quantifying Privacy Risk in Synthetic Data according to the GDPRβ99Jan 9, 2026Updated 2 months ago
- Python toolbox for multi-omics data mapping and analysisβ26Apr 13, 2023Updated 2 years ago
- An open-source Python library for the assessment of utility and privacy performance of any tabular synthetic dataset.β23Jun 12, 2025Updated 9 months ago
- Vector Approximate Message Passing inference framework for GWASβ19Jan 14, 2026Updated 2 months ago
- Synthetic data generation by a Variational AutoEncoder with Differential Privacy assessed using Synthetic Data Vault metricsβ46Apr 4, 2023Updated 2 years ago
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.β645Feb 11, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!β27Jul 13, 2023Updated 2 years ago
- Tools and service for differentially private processing of tabular and relational dataβ296Mar 7, 2026Updated 3 weeks ago
- β51May 23, 2023Updated 2 years ago
- Implementation of Quantum Ising Born Machineβ19May 1, 2020Updated 5 years ago
- An R-Package to Visualize the (Causal) Effect of a Continuous Variable on a Time-To-Event Outcomeβ16Jan 29, 2026Updated 2 months ago
- Semantic data model of the set of common data elements for rare disease registrationβ12Oct 26, 2023Updated 2 years ago
- Synthetic data generation for tabular dataβ3,451Updated this week
- β17Jul 15, 2024Updated last year
- Metrics to evaluate quality and efficacy of synthetic datasets.β257Mar 23, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Pipeline for genetic epidemiology projects at Univsersity of Bristolβ15Dec 18, 2025Updated 3 months ago
- β16Jun 17, 2024Updated last year
- Code for GBMI trans-ancestry proteome Mendelian randomization satellite paperβ18Dec 25, 2023Updated 2 years ago
- Dual Adversarial Autoencoder for Generating Set-valued Sequencesβ19Jan 15, 2021Updated 5 years ago
- A Data-Centric library providing a unified interface for state-of-the-art methods for hardness characterisation of data points.β25Mar 6, 2025Updated last year
- This is the implementation of the Recursive Nearest (Neighbor) Agglomerationβ11Oct 9, 2020Updated 5 years ago
- A comprehensive evaluation framework for the SEA regionβ19Mar 4, 2026Updated 3 weeks ago
- β11May 17, 2021Updated 4 years ago
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)β18Dec 5, 2024Updated last year
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SPARQL visual query builder and RDF explorerβ19Jan 16, 2020Updated 6 years ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ69Feb 22, 2023Updated 3 years ago
- Visualization of many Clustering Algorithms, via Notebook or GUIβ24Apr 6, 2021Updated 4 years ago
- β21Aug 20, 2024Updated last year
- PyTrial: A Comprehensive Platform for Artificial Intelligence for Drug Developmentβ124Jan 8, 2024Updated 2 years ago
- THIS IS THE OLD REPO: Use this one instead: https://github.com/monarch-initiative/mondo-buildβ17Feb 5, 2021Updated 5 years ago
- β19Apr 15, 2022Updated 3 years ago