gretelai / awesome-synthetic-data
π A curated list of resources dedicated to synthetic data
β125Updated 2 years ago
Alternatives and similar repositories for awesome-synthetic-data:
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
- A curated list of awesome synthetic data tools (open source and commercial).β153Updated last year
- Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.β29Updated last week
- Use FastCUT with public map images and location data from a few cities to generate realistic synthetic location data for any city in the β¦β23Updated 3 years ago
- A curated list of awesome resources for creating synthetic dataβ41Updated 3 years ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β221Updated 2 months ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ64Updated last year
- Metrics to evaluate quality and efficacy of synthetic datasets.β222Updated this week
- Benchmarking synthetic data generation methods.β267Updated this week
- Fiddler Auditor is a tool to evaluate language models.β175Updated 11 months ago
- Public blueprints for data use casesβ74Updated this week
- A library of Reversible Data Transformsβ123Updated this week
- Where Gretel published notebooks and code for blog postsβ19Updated last year
- A novel approach for synthesizing tabular data using pretrained large language modelsβ296Updated 3 months ago
- Generative models to automatically anonymize data to meet GDPR & CCPA standards.β31Updated 2 years ago
- The Gretel Python Client allows you to interact with the Gretel REST API.β53Updated this week
- A Natural Language Interface to Explainable Boosting Machinesβ64Updated 7 months ago
- TalkToModel gives anyone with the powers of XAI through natural language conversations π¬!β120Updated last year
- Experimental library integrating LLM capabilities to support causal analysesβ106Updated 5 months ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word predβ¦β88Updated 6 months ago
- Introduction to Data-Centric AI, MIT IAP 2023 π€β98Updated last week
- A curated list of awesome academic research, books, code of ethics, data sets, institutes, maturity models, newsletters, principles, podcβ¦β65Updated this week
- Private Evolution: Generating DP Synthetic Data without Training [ICLR 2024, ICML 2024 Spotlight]β88Updated last week
- Federated Learning Utilities and Tools for Experimentationβ187Updated last year
- This repository provides a curated list of references about Machine Learning Model Governance, Ethics, and Responsible AI.β108Updated 10 months ago
- A toolbox for differentially private data generationβ129Updated last year
- Synthetic data generation by a Variational AutoEncoder with Differential Privacy assessed using Synthetic Data Vault metricsβ45Updated last year
- Differentially-private transformers using HuggingFace and Opacusβ132Updated 5 months ago
- SDNist: Benchmark data and evaluation tools for data synthesizers.β34Updated last week
- π A curated list of papers & technical articles on AI Quality & Safetyβ169Updated last year