A curated list of awesome synthetic data tools (open source and commercial).
β245Jan 11, 2024Updated 2 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π A curated list of resources dedicated to synthetic dataβ141Jul 29, 2022Updated 3 years ago
- A Unified Framework for Quantifying Privacy Risk in Synthetic Data according to the GDPRβ99Jan 9, 2026Updated 2 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. Iβ¦β24Jun 22, 2022Updated 3 years ago
- Synthetic data generation for tabular dataβ3,451Updated this week
- Standardised Metrics and Methods for Synthetic Tabular Data Evaluationβ36Aug 14, 2024Updated last year
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A software package for privacy-preserving generation of a synthetic twin to a given sensitive data set.β56Sep 3, 2024Updated last year
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β675Jun 24, 2025Updated 9 months ago
- Build datasets using natural languageβ571Sep 19, 2025Updated 6 months ago
- Repository for the results of my master thesis, about the generation and evaluation of synthetic data using GANsβ45Jun 21, 2023Updated 2 years ago
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.β645Feb 11, 2026Updated last month
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Expressβ13Oct 16, 2024Updated last year
- GWAS Summary Statistics for Brain Imaging Phenotypesβ20Sep 6, 2021Updated 4 years ago
- β275Apr 3, 2024Updated last year
- How to write integration tests for data pipelines using Great Expectations and pytest.β15Dec 12, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Proof of concept code from Gretel.ai and Illumina using generative neural networks to create synthetic versions of mouse genotype and pheβ¦β33Jan 19, 2022Updated 4 years ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.β14Dec 15, 2024Updated last year
- Synthetic Data Generation for mixed-type, multivariate time series.β121Feb 23, 2026Updated last month
- β20Jan 10, 2024Updated 2 years ago
- Source code for the Observatory of Anonymityβ10Dec 5, 2022Updated 3 years ago
- Synthetic Data SDK β¨β753Jan 13, 2026Updated 2 months ago
- Simple template for running snakemake with Rβ12Jan 31, 2023Updated 3 years ago
- Algorithms for generating synthetic dataβ16Jun 18, 2024Updated last year
- Swift package that houses commonly used functions, extensions, views, classes, etc.β13Oct 25, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β15Apr 29, 2025Updated 10 months ago
- A Shared Nearest Neighbors clustering implementation. This code is basically a wrapper of sklearn DBSCAN, implementing the neighborhood sβ¦β16Jan 10, 2022Updated 4 years ago
- Scan and monitor your network effortlessly! Nmap Prometheus Exporter provides insights into network health and security with Prometheus-cβ¦β15Oct 2, 2023Updated 2 years ago
- A simple example of VAEs with KANsβ12May 17, 2024Updated last year
- Official Implementation of Knowledge Flow Promptingβ35Oct 20, 2025Updated 5 months ago
- In this article, I will present an open-source AI tool for writing grant applications, using Microsoft AutoGen combined with Retrieval-Auβ¦β23Jul 19, 2025Updated 8 months ago
- Vector Approximate Message Passing inference framework for GWASβ19Jan 14, 2026Updated 2 months ago
- Web application that makes data releases that satisfy differential privacy using the OpenDP Libraryβ22Aug 2, 2024Updated last year
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ216Sep 18, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Pair-wise conditional analysis and colocalisationβ42May 2, 2024Updated last year
- β13Apr 25, 2025Updated 11 months ago
- A curated list of materials on AI guardrailsβ48Jun 3, 2025Updated 9 months ago
- Multivariate Electricity Consumption Prediction with Extreme Learning Machineβ10Jun 25, 2018Updated 7 years ago
- Explains Canadian Billsβ17May 13, 2023Updated 2 years ago
- β40Mar 20, 2025Updated last year
- A collection of tutorials, demos, and use cases for IBM Data Science Experience http://datascience.ibm.com/β14Nov 20, 2017Updated 8 years ago