gretelai / awesome-synthetic-dataLinks
π A curated list of resources dedicated to synthetic data
β137Updated 3 years ago
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
Sorting:
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β237Updated 3 months ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ69Updated 2 years ago
- A curated list of awesome resources for creating synthetic dataβ44Updated 3 years ago
- A curated list of awesome synthetic data tools (open source and commercial).β217Updated last year
- Metrics to evaluate quality and efficacy of synthetic datasets.β251Updated this week
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Actβ93Updated 2 years ago
- A library of Reversible Data Transformsβ128Updated this week
- TalkToModel gives anyone with the powers of XAI through natural language conversations π¬!β125Updated 2 years ago
- Introduction to Data-Centric AI, MIT IAP 2024 π€β103Updated 4 months ago
- Public blueprints for data use casesβ85Updated last month
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.β20Updated last year
- Open-Source Software, Tutorials, and Research on Data-Centric AI π€β342Updated last year
- Benchmarking synthetic data generation methods.β281Updated last week
- A Natural Language Interface to Explainable Boosting Machinesβ68Updated last year
- A novel approach for synthesizing tabular data using pretrained large language modelsβ324Updated 4 months ago
- A curated list of awesome academic research, books, code of ethics, courses, databases, data sets, frameworks, institutes, maturity modeβ¦β87Updated this week
- π A curated list of papers & technical articles on AI Quality & Safetyβ192Updated 6 months ago
- This is an open-source tool to assess and improve the trustworthiness of AI systems.β99Updated last month
- Public repository holding examples for dataheroes libraryβ24Updated 4 months ago
- Fiddler Auditor is a tool to evaluate language models.β188Updated last year
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automatβ¦β172Updated 10 months ago
- The Foundation Model Transparency Indexβ83Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in theirβ¦β155Updated 2 weeks ago
- This repository provides a curated list of references about Machine Learning Model Governance, Ethics, and Responsible AI.β118Updated last year
- β172Updated last week
- Cross-field empirical trends analysis of XAI literatureβ21Updated 2 years ago
- Interpret text data with LLMs (sklearn compatible).β171Updated 3 weeks ago
- β268Updated 9 months ago
- Use FastCUT with public map images and location data from a few cities to generate realistic synthetic location data for any city in the β¦β23Updated 3 years ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β663Updated 4 months ago