π A curated list of resources dedicated to synthetic data
β141Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of awesome synthetic data tools (open source and commercial).β250Jan 11, 2024Updated 2 years ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β675Jun 24, 2025Updated 9 months ago
- pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.β52Apr 8, 2026Updated last week
- β43Dec 7, 2022Updated 3 years ago
- β32Mar 21, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python toolbox for multi-omics data mapping and analysisβ26Apr 13, 2023Updated 3 years ago
- A Unified Framework for Quantifying Privacy Risk in Synthetic Data according to the GDPRβ100Apr 8, 2026Updated last week
- An open-source Python library for the assessment of utility and privacy performance of any tabular synthetic dataset.β23Jun 12, 2025Updated 10 months ago
- Supplemental Material for the ESANN 2019 Submission "Preserving privacy using synthetic data models and applications in health informaticβ¦β20Mar 20, 2020Updated 6 years ago
- β26Mar 9, 2023Updated 3 years ago
- Chat with Time-Series Data in PostgreSQL using LlamaIndex and Timescale Vectorβ12Mar 24, 2024Updated 2 years ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"β44Jul 19, 2024Updated last year
- Vector Approximate Message Passing inference framework for GWASβ19Jan 14, 2026Updated 3 months ago
- Synthetic data generation by a Variational AutoEncoder with Differential Privacy assessed using Synthetic Data Vault metricsβ46Apr 4, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LMQL implementation of tree of thoughtsβ36Jan 31, 2024Updated 2 years ago
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.β651Feb 11, 2026Updated 2 months ago
- β23Oct 7, 2025Updated 6 months ago
- Curated LLM (ICML 2024)β14Oct 23, 2024Updated last year
- Tools and service for differentially private processing of tabular and relational dataβ295Updated this week
- β51May 23, 2023Updated 2 years ago
- E-Syn: E-Graph Rewriting with Technology-Aware Cost Functions for Logic Synthesis (DAC 2024)β42Jul 17, 2024Updated last year
- Semantic data model of the set of common data elements for rare disease registrationβ12Oct 26, 2023Updated 2 years ago
- Synthetic data generation for tabular dataβ3,467Updated this week
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Data, codebook, and models to automatically detect storytelling.β29Apr 23, 2025Updated 11 months ago
- Metrics to evaluate quality and efficacy of synthetic datasets.β258Updated this week
- β16Jun 17, 2024Updated last year
- Dual Adversarial Autoencoder for Generating Set-valued Sequencesβ19Jan 15, 2021Updated 5 years ago
- A Data-Centric library providing a unified interface for state-of-the-art methods for hardness characterisation of data points.β25Mar 6, 2025Updated last year
- This is the implementation of the Recursive Nearest (Neighbor) Agglomerationβ11Oct 9, 2020Updated 5 years ago
- β18Jul 15, 2024Updated last year
- An implementation of the FuzzyDBSCAN algorithm.β13Nov 6, 2022Updated 3 years ago
- SPARQL visual query builder and RDF explorerβ19Jan 16, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [Robust cardiac MR image segmentation foundation model] This code contains the most powerful cardiac segmentation model trained from UK bβ¦β14Jun 13, 2022Updated 3 years ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ69Feb 22, 2023Updated 3 years ago
- This project is a FAIRness assessment tool for ontologies, vocabularies and semantic resources.β23Mar 12, 2025Updated last year
- THIS IS THE OLD REPO: Use this one instead: https://github.com/monarch-initiative/mondo-buildβ17Feb 5, 2021Updated 5 years ago
- Signal processing on graphs using torch_geometric.β14Jul 11, 2022Updated 3 years ago
- Framework for image and flow cytometry analysisβ33Mar 17, 2026Updated last month
- Repository for the results of my master thesis, about the generation and evaluation of synthetic data using GANsβ45Jun 21, 2023Updated 2 years ago