π A curated list of resources dedicated to synthetic data
β142Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.β28Mar 5, 2025Updated last year
- Proof of concept code from Gretel.ai and Illumina using generative neural networks to create synthetic versions of mouse genotype and pheβ¦β33Jan 19, 2022Updated 4 years ago
- Generative models to automatically anonymize data to meet GDPR & CCPA standards.β30Jan 24, 2023Updated 3 years ago
- Public blueprints for data use casesβ85Sep 10, 2025Updated 8 months ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β678Jun 24, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A curated list of awesome resources for creating synthetic dataβ45Feb 16, 2022Updated 4 years ago
- pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.β52May 19, 2026Updated last week
- β44Dec 7, 2022Updated 3 years ago
- A Unified Framework for Quantifying Privacy Risk in Synthetic Data according to the GDPRβ101Apr 8, 2026Updated last month
- β26Mar 9, 2023Updated 3 years ago
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.β663Apr 21, 2026Updated last month
- Curated LLM (ICML 2024)β14Oct 23, 2024Updated last year
- Multivariate Electricity Consumption Prediction with Extreme Learning Machineβ10Jun 25, 2018Updated 7 years ago
- Tools and service for differentially private processing of tabular and relational dataβ296May 1, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of Quantum Ising Born Machineβ19May 1, 2020Updated 6 years ago
- β52May 23, 2023Updated 3 years ago
- Synthetic data generation for tabular dataβ3,497Updated this week
- β13Aug 6, 2024Updated last year
- Metrics to evaluate quality and efficacy of synthetic datasets.β259May 18, 2026Updated last week
- This work combines differential privacy and multi-party computation protocol to achieve distributed machine learning.β27Oct 15, 2020Updated 5 years ago
- Dual Adversarial Autoencoder for Generating Set-valued Sequencesβ19Jan 15, 2021Updated 5 years ago
- A Data-Centric library providing a unified interface for state-of-the-art methods for hardness characterisation of data points.β26Mar 6, 2025Updated last year
- β17Jul 15, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Fun project to run your own LLM chat bot using llama.cppβ11Jun 9, 2023Updated 2 years ago
- β11May 17, 2021Updated 5 years ago
- An implementation of the FuzzyDBSCAN algorithm.β13Nov 6, 2022Updated 3 years ago
- Experiments studying ensemble methods for stock portfolio selectionβ14Oct 4, 2017Updated 8 years ago
- Visualization of many Clustering Algorithms, via Notebook or GUIβ24Apr 13, 2026Updated last month
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ70Feb 22, 2023Updated 3 years ago
- A comprehensive evaluation framework for the SEA regionβ27Apr 20, 2026Updated last month
- DSWE on the GEEβ10Nov 4, 2020Updated 5 years ago
- DNA Microarray Gene Expression Data Classification Using SVM and MLP with Feature Selection Methods Relief and LASSOβ29Jun 25, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Kaggleμ μ²μ μ νλ μ¬λλ€μ μν λ¬Έμβ10Jan 7, 2021Updated 5 years ago
- A software package for privacy-preserving generation of a synthetic twin to a given sensitive data set.β54Sep 3, 2024Updated last year
- A platform for image algorithm validation in public challenges.β16Jan 13, 2017Updated 9 years ago
- Code of the paper Fair k-Means Clusteringβ14Oct 30, 2021Updated 4 years ago
- A curated list of Federated Learning papers/articles and recent advancements.β105Feb 9, 2026Updated 3 months ago
- Abstract BusinessObject for StromDAO Energy Blockchain. Abstraction layer between blockchain technology and business logic providing enerβ¦β10Dec 6, 2022Updated 3 years ago
- [IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questionsβ314Nov 3, 2023Updated 2 years ago