joofio / awesome-data-synthesis
A curated list of awesome resources for creating synthetic data
β42Updated 3 years ago
Alternatives and similar repositories for awesome-data-synthesis:
Users that are interested in awesome-data-synthesis are comparing it to the libraries listed below
- A software package for privacy-preserving generation of a synthetic twin to a given sensitive data set.β52Updated 8 months ago
- π A curated list of resources dedicated to synthetic dataβ126Updated 2 years ago
- A library of Reversible Data Transformsβ124Updated this week
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β226Updated last month
- Metrics to evaluate quality and efficacy of synthetic datasets.β231Updated 3 weeks ago
- Benchmarking synthetic data generation methods.β273Updated this week
- pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.β36Updated last week
- Frouros: an open-source Python library for drift detection in machine learning systems.β215Updated 3 months ago
- A toolbox for differentially private data generationβ131Updated last year
- A novel approach for synthesizing tabular data using pretrained large language modelsβ311Updated 6 months ago
- Evaluate real and synthetic datasets against each otherβ88Updated 4 months ago
- This repo accompanies the FF22 research cycle focused on unsupervised methods for detecting concept driftβ29Updated 3 years ago
- A Unified Framework for Quantifying Privacy Risk in Synthetic Data according to the GDPRβ87Updated 2 months ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ66Updated 2 years ago
- Privacy-Preserving Machine Learning (PPML) Tutorialβ38Updated 11 months ago
- Tools and service for differentially private processing of tabular and relational dataβ265Updated 3 months ago
- Repository for the results of my master thesis, about the generation and evaluation of synthetic data using GANsβ44Updated last year
- Use FastCUT with public map images and location data from a few cities to generate realistic synthetic location data for any city in the β¦β23Updated 3 years ago
- UCLANesl - NIST Differential Privacy Challenge (Match 3)β24Updated 5 years ago
- Robust de-identification of medical notes using transformer architecturesβ52Updated 2 years ago
- Official git for "CTAB-GAN: Effective Table Data Synthesizing"β85Updated last year
- Official mirror of Python-FHEz; Python Fully Homomorphic Encryption (FHE) Library for Encrypted Deep Learning as a Service (EDLaaS).β29Updated 3 years ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profilβ¦β76Updated last year
- Differentially Private Synthetic Data Generation [DP-SDG] - Experimental Setups & Knowledge Base - WORK IN PROGRESSβ12Updated 2 years ago
- β39Updated 2 years ago
- A Natural Language Interface to Explainable Boosting Machinesβ66Updated 10 months ago
- Federated Learning Utilities and Tools for Experimentationβ189Updated last year
- Official GitHub for CTAB-GAN+β75Updated 11 months ago
- An open source automl library for using machine learning in healthcare.β118Updated last year
- Experiments on Tabular Data Modelsβ277Updated last year