gretelai / awesome-synthetic-data
š A curated list of resources dedicated to synthetic data
ā118Updated 2 years ago
Related projects ā
Alternatives and complementary repositories for awesome-synthetic-data
- Use FastCUT with public map images and location data from a few cities to generate realistic synthetic location data for any city in the ā¦ā22Updated 2 years ago
- A curated list of awesome synthetic data tools (open source and commercial).ā107Updated 10 months ago
- Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.ā29Updated this week
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.ā212Updated this week
- The Gretel Python Client allows you to interact with the Gretel REST API.ā53Updated this week
- Fiddler Auditor is a tool to evaluate language models.ā171Updated 8 months ago
- Public blueprints for data use casesā72Updated this week
- Metrics to evaluate quality and efficacy of synthetic datasets.ā212Updated this week
- Generative models to automatically anonymize data to meet GDPR & CCPA standards.ā30Updated last year
- Where Gretel published notebooks and code for blog postsā19Updated last year
- A curated list of awesome resources for creating synthetic dataā39Updated 2 years ago
- Framework for building and maintaining self-updating prompts for LLMsā59Updated 5 months ago
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsā63Updated last year
- A library of Reversible Data Transformsā121Updated this week
- ā28Updated last year
- Leverage your LangChain trace data for fine tuningā38Updated 3 months ago
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Actā92Updated last year
- A Natural Language Interface to Explainable Boosting Machinesā60Updated 4 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. Iā¦ā19Updated 2 years ago
- A novel approach for synthesizing tabular data using pretrained large language modelsā286Updated 3 weeks ago
- An open-source compliance-centered evaluation framework for Generative AI modelsā106Updated last week
- This is an open-source tool to assess and improve the trustworthiness of AI systems.ā80Updated this week
- ā75Updated 5 months ago
- Tabular In-Context Learningā26Updated last month
- Privacy-Preserving Machine Learning (PPML) Tutorialā37Updated 5 months ago
- Benchmarking synthetic data generation methods.ā262Updated this week
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.ā89Updated this week
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichmentā33Updated last year
- ā20Updated last year
- Drift detection module for machine learning pipelines.ā21Updated last year