gretelai / awesome-synthetic-data
π A curated list of resources dedicated to synthetic data
β118Updated 2 years ago
Related projects β
Alternatives and complementary repositories for awesome-synthetic-data
- A curated list of awesome synthetic data tools (open source and commercial).β104Updated 9 months ago
- Use FastCUT with public map images and location data from a few cities to generate realistic synthetic location data for any city in the β¦β22Updated 2 years ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β212Updated 3 weeks ago
- Public blueprints for data use casesβ71Updated this week
- The Gretel Python Client allows you to interact with the Gretel REST API.β53Updated this week
- A curated list of awesome resources for creating synthetic dataβ39Updated 2 years ago
- Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.β29Updated last month
- Metrics to evaluate quality and efficacy of synthetic datasets.β211Updated this week
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β590Updated last week
- Federated Learning Utilities and Tools for Experimentationβ185Updated 9 months ago
- Fiddler Auditor is a tool to evaluate language models.β171Updated 7 months ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ62Updated last year
- Generative models to automatically anonymize data to meet GDPR & CCPA standards.β30Updated last year
- A library of Reversible Data Transformsβ121Updated this week
- Where Gretel published notebooks and code for blog postsβ19Updated last year
- A Natural Language Interface to Explainable Boosting Machinesβ60Updated 4 months ago
- A software package for privacy-preserving generation of a synthetic twin to a given sensitive data set.β47Updated 2 months ago
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Actβ92Updated last year
- β20Updated last year
- Benchmarking synthetic data generation methods.β262Updated this week
- π€ Trade any tensors over the networkβ30Updated last year
- Code for paper: "Privately generating tabular data using language models".β14Updated last year
- Official mirror of Python-FHEz; Python Fully Homomorphic Encryption (FHE) Library for Encrypted Deep Learning as a Service (EDLaaS).β28Updated 2 years ago
- This is an open-source tool to assess and improve the trustworthiness of AI systems.β78Updated this week
- Editing machine learning models to reflect human knowledge and valuesβ123Updated last year
- A novel approach for synthesizing tabular data using pretrained large language modelsβ281Updated last week
- A curated list of awesome academic research, books, code of ethics, data sets, institutes, newsletters, principles, podcasts, reports, toβ¦β52Updated this week
- Privacy-Preserving Machine Learning (PPML) Tutorialβ37Updated 5 months ago
- pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.β29Updated this week
- Client interface for all things Cleanlab Studioβ27Updated this week