gretelai / awesome-synthetic-dataLinks
π A curated list of resources dedicated to synthetic data
β132Updated 3 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
Sorting:
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β235Updated 3 weeks ago
- A curated list of awesome synthetic data tools (open source and commercial).β197Updated last year
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ68Updated 2 years ago
- Public blueprints for data use casesβ81Updated 2 weeks ago
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Actβ94Updated last year
- A curated list of awesome resources for creating synthetic dataβ43Updated 3 years ago
- Metrics to evaluate quality and efficacy of synthetic datasets.β243Updated last week
- A curated list of awesome academic research, books, code of ethics, data sets, institutes, maturity models, newsletters, principles, podcβ¦β78Updated this week
- Open-Source Software, Tutorials, and Research on Data-Centric AI π€β338Updated last year
- Fiddler Auditor is a tool to evaluate language models.β184Updated last year
- Introduction to Data-Centric AI, MIT IAP 2023 π€β102Updated last month
- π A curated list of papers & technical articles on AI Quality & Safetyβ188Updated 3 months ago
- Foundation Models for Data Tasksβ108Updated 2 years ago
- A library of Reversible Data Transformsβ127Updated this week
- Benchmarking synthetic data generation methods.β275Updated this week
- TalkToModel gives anyone with the powers of XAI through natural language conversations π¬!β121Updated 2 years ago
- An open-source compliance-centered evaluation framework for Generative AI modelsβ159Updated this week
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.β82Updated 2 months ago
- This is an open-source tool to assess and improve the trustworthiness of AI systems.β93Updated 3 weeks ago
- A novel approach for synthesizing tabular data using pretrained large language modelsβ317Updated last month
- A Natural Language Interface to Explainable Boosting Machinesβ68Updated last year
- Library for creating causal chains using language models.β79Updated 2 years ago
- A Python library for rapid prototyping, experimenting, and logging of federated learning using state-of-the-art models and datasets. Builβ¦β42Updated 11 months ago
- Interpret text data using LLMs (scikit-learn compatible).β169Updated this week
- This repository provides a curated list of references about Machine Learning Model Governance, Ethics, and Responsible AI.β116Updated last year
- SPEAR: Programmatically label and build training data quickly.β107Updated last year
- The Gretel Python Client allows you to interact with the Gretel REST API.β56Updated 2 weeks ago
- Use FastCUT with public map images and location data from a few cities to generate realistic synthetic location data for any city in the β¦β23Updated 3 years ago
- β87Updated last year
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automatβ¦β167Updated 7 months ago