gretelai / awesome-synthetic-dataLinks
π A curated list of resources dedicated to synthetic data
β140Updated 3 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
Sorting:
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β242Updated 2 weeks ago
- A curated list of awesome synthetic data tools (open source and commercial).β231Updated 2 years ago
- Public blueprints for data use casesβ85Updated 4 months ago
- Fiddler Auditor is a tool to evaluate language models.β188Updated last year
- π A curated list of papers & technical articles on AI Quality & Safetyβ199Updated 9 months ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ69Updated 2 years ago
- TalkToModel gives anyone with the powers of XAI through natural language conversations π¬!β126Updated 2 years ago
- Introduction to Data-Centric AI, MIT IAP 2024 π€β105Updated 6 months ago
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Actβ93Updated 2 years ago
- Identify bias and measure fairness of your dataβ95Updated last month
- Metrics to evaluate quality and efficacy of synthetic datasets.β256Updated this week
- Foundation Models for Data Tasksβ110Updated 2 years ago
- This is an open-source tool to assess and improve the trustworthiness of AI systems.β100Updated last month
- A curated list of awesome resources for creating synthetic dataβ44Updated 3 years ago
- A novel approach for synthesizing tabular data using pretrained large language modelsβ339Updated last month
- Notebooks demonstrating example applications of the cleanlabΒ libraryβ132Updated last month
- Open-Source Software, Tutorials, and Research on Data-Centric AI π€β344Updated 2 years ago
- This repository provides a curated list of references about Machine Learning Model Governance, Ethics, and Responsible AI.β122Updated last year
- Framework for building and maintaining self-updating prompts for LLMsβ65Updated last year
- The Foundation Model Transparency Indexβ85Updated last month
- A curated list of awesome academic research, books, code of ethics, courses, databases, data sets, frameworks, institutes, maturity modeβ¦β109Updated last week
- ReLM is a Regular Expression engine for Language Modelsβ107Updated 2 years ago
- A library of Reversible Data Transformsβ131Updated this week
- SPEAR: Programmatically label and build training data quickly.β109Updated last year
- β271Updated 11 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. Iβ¦β25Updated 3 years ago
- π€ Disaggregators: Curated data labelers for in-depth analysis.β67Updated 2 years ago
- A Natural Language Interface to Explainable Boosting Machinesβ69Updated last year
- β22Updated 2 years ago
- Benchmarking synthetic data generation methods.β297Updated last week