gretelai / awesome-synthetic-data
📖 A curated list of resources dedicated to synthetic data
☆126Updated 2 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
Sorting:
- Use FastCUT with public map images and location data from a few cities to generate realistic synthetic location data for any city in the …☆23Updated 3 years ago
- Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.☆28Updated 2 months ago
- Public blueprints for data use cases☆76Updated last week
- A curated list of awesome resources for creating synthetic data☆42Updated 3 years ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.☆226Updated 2 months ago
- The Gretel Python Client allows you to interact with the Gretel REST API.☆55Updated last week
- A curated list of awesome synthetic data tools (open source and commercial).☆179Updated last year
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆94Updated last year
- Generative models to automatically anonymize data to meet GDPR & CCPA standards.☆31Updated 2 years ago
- Fiddler Auditor is a tool to evaluate language models.☆179Updated last year
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆637Updated last month
- A novel approach for synthesizing tabular data using pretrained large language models☆310Updated this week
- SDNist: Benchmark data and evaluation tools for data synthesizers.☆35Updated last month
- This is an open-source tool to assess and improve the trustworthiness of AI systems.☆90Updated this week
- Where Gretel published notebooks and code for blog posts☆18Updated last year
- A curated list of awesome academic research, books, code of ethics, data sets, institutes, maturity models, newsletters, principles, podc…☆71Updated last week
- Metrics to evaluate quality and efficacy of synthetic datasets.☆233Updated 3 weeks ago
- A Natural Language Interface to Explainable Boosting Machines☆66Updated 10 months ago
- Benchmarking synthetic data generation methods.☆273Updated last week
- Data for the Chat With Your Data benchmark.☆137Updated last year
- Research on Tabular Foundation Models☆49Updated 5 months ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasets☆66Updated 2 years ago
- Differentially-private transformers using HuggingFace and Opacus☆138Updated 8 months ago
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated 3 months ago
- ☆267Updated 3 months ago
- Private Evolution: Generating DP Synthetic Data without Training [ICLR 2024, ICML 2024 Spotlight]☆95Updated this week
- Code for paper: "Privately generating tabular data using language models".☆15Updated last year
- A library of Reversible Data Transforms☆124Updated last week
- A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!☆11Updated 7 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year