gretelai / awesome-synthetic-dataLinks
π A curated list of resources dedicated to synthetic data
β136Updated 3 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
Sorting:
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β236Updated 2 months ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ68Updated 2 years ago
- A curated list of awesome synthetic data tools (open source and commercial).β211Updated last year
- Public blueprints for data use casesβ84Updated 3 weeks ago
- Metrics to evaluate quality and efficacy of synthetic datasets.β248Updated this week
- Public repository holding examples for dataheroes libraryβ24Updated 4 months ago
- A curated list of awesome academic research, books, code of ethics, data sets, institutes, maturity models, newsletters, principles, podcβ¦β85Updated this week
- This is an open-source tool to assess and improve the trustworthiness of AI systems.β99Updated last month
- Fiddler Auditor is a tool to evaluate language models.β188Updated last year
- TalkToModel gives anyone with the powers of XAI through natural language conversations π¬!β124Updated 2 years ago
- A library of Reversible Data Transformsβ128Updated this week
- A novel approach for synthesizing tabular data using pretrained large language modelsβ322Updated 3 months ago
- Introduction to Data-Centric AI, MIT IAP 2024 π€β103Updated 3 months ago
- A curated list of awesome resources for creating synthetic dataβ43Updated 3 years ago
- Identify bias and measure fairness of your dataβ96Updated this week
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Actβ93Updated last year
- Open-Source Software, Tutorials, and Research on Data-Centric AI π€β339Updated last year
- A Natural Language Interface to Explainable Boosting Machinesβ68Updated last year
- π A curated list of papers & technical articles on AI Quality & Safetyβ193Updated 5 months ago
- Foundation Models for Data Tasksβ109Updated 2 years ago
- Interpret text data using LLMs (scikit-learn compatible).β170Updated 2 weeks ago
- β267Updated 8 months ago
- openclean - Data Cleaning and data profiling library for Pythonβ82Updated 3 years ago
- Benchmarking synthetic data generation methods.β279Updated last week
- Framework for building and maintaining self-updating prompts for LLMsβ64Updated last year
- ReLM is a Regular Expression engine for Language Modelsβ106Updated 2 years ago
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automatβ¦β170Updated 9 months ago
- AI Data Management & Evaluation Platformβ216Updated 2 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.β44Updated 7 months ago
- SPEAR: Programmatically label and build training data quickly.β108Updated last year