π A curated list of resources dedicated to synthetic data
β142Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.β28Mar 5, 2025Updated last year
- Proof of concept code from Gretel.ai and Illumina using generative neural networks to create synthetic versions of mouse genotype and pheβ¦β33Jan 19, 2022Updated 4 years ago
- A curated list of awesome synthetic data tools (open source and commercial).β252Jan 11, 2024Updated 2 years ago
- Public blueprints for data use casesβ85Sep 10, 2025Updated 7 months ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β677Jun 24, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A curated list of awesome resources for creating synthetic dataβ45Feb 16, 2022Updated 4 years ago
- β14Jun 24, 2024Updated last year
- A Unified Framework for Quantifying Privacy Risk in Synthetic Data according to the GDPRβ101Apr 8, 2026Updated last month
- An open-source Python library for the assessment of utility and privacy performance of any tabular synthetic dataset.β23Jun 12, 2025Updated 10 months ago
- Supplemental Material for the ESANN 2019 Submission "Preserving privacy using synthetic data models and applications in health informaticβ¦β20Mar 20, 2020Updated 6 years ago
- Synthetic data generation by a Variational AutoEncoder with Differential Privacy assessed using Synthetic Data Vault metricsβ46Apr 4, 2023Updated 3 years ago
- LMQL implementation of tree of thoughtsβ36Jan 31, 2024Updated 2 years ago
- A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.β659Apr 21, 2026Updated 2 weeks ago
- Curated LLM (ICML 2024)β14Oct 23, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tools and service for differentially private processing of tabular and relational dataβ294May 1, 2026Updated last week
- β52May 23, 2023Updated 2 years ago
- Synthetic data generation for tabular dataβ3,480Apr 24, 2026Updated 2 weeks ago
- Simple PyTorch Implementation of Physics Informed Neural Network (PINN)β13Mar 9, 2021Updated 5 years ago
- β14Mar 23, 2026Updated last month
- β13Aug 6, 2024Updated last year
- a faceted browser on top of RDF data available through SPARQL endpoints that support COUNT/GROUP BY queriesβ36Feb 10, 2014Updated 12 years ago
- β16May 12, 2023Updated 2 years ago
- Metrics to evaluate quality and efficacy of synthetic datasets.β259Apr 13, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β16Jun 17, 2024Updated last year
- This work combines differential privacy and multi-party computation protocol to achieve distributed machine learning.β27Oct 15, 2020Updated 5 years ago
- Dual Adversarial Autoencoder for Generating Set-valued Sequencesβ19Jan 15, 2021Updated 5 years ago
- A Data-Centric library providing a unified interface for state-of-the-art methods for hardness characterisation of data points.β25Mar 6, 2025Updated last year
- This is the implementation of the Recursive Nearest (Neighbor) Agglomerationβ11Oct 9, 2020Updated 5 years ago
- β18Jul 15, 2024Updated last year
- [Robust cardiac MR image segmentation foundation model] This code contains the most powerful cardiac segmentation model trained from UK bβ¦β14Jun 13, 2022Updated 3 years ago
- Visualization of many Clustering Algorithms, via Notebook or GUIβ24Apr 13, 2026Updated 3 weeks ago
- OCR Engineβ17Dec 31, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A comprehensive evaluation framework for the SEA regionβ24Apr 20, 2026Updated 2 weeks ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable froβ¦β29Jul 7, 2022Updated 3 years ago
- Re-ranking task using MS MARCO dataset and Hugging Face libraryβ15Jun 7, 2020Updated 5 years ago
- DNA Microarray Gene Expression Data Classification Using SVM and MLP with Feature Selection Methods Relief and LASSOβ29Jun 25, 2019Updated 6 years ago
- PyTrial: A Comprehensive Platform for Artificial Intelligence for Drug Developmentβ126Jan 8, 2024Updated 2 years ago
- β19Apr 15, 2022Updated 4 years ago
- Kaggleμ μ²μ μ νλ μ¬λλ€μ μν λ¬Έμβ10Jan 7, 2021Updated 5 years ago