statice / awesome-synthetic-data
A curated list of awesome synthetic data tools (open source and commercial).
β153Updated last year
Alternatives and similar repositories for awesome-synthetic-data:
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
- π A curated list of resources dedicated to synthetic dataβ125Updated 2 years ago
- Fiddler Auditor is a tool to evaluate language models.β175Updated 11 months ago
- π A curated list of papers & technical articles on AI Quality & Safetyβ168Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ145Updated 10 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β100Updated last week
- Framework for building data agent workflowsβ83Updated 6 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.β85Updated 2 weeks ago
- How far can we go with an LLM for a classification problemβ24Updated 2 months ago
- Synthetic Data SDK β¨β201Updated this week
- β141Updated 7 months ago
- all code examples in the blog postsβ24Updated 3 weeks ago
- π¦π― Flex those feathers!β239Updated 3 months ago
- β76Updated 4 months ago
- β46Updated 8 months ago
- A project that enables identification and classification of an intent of a message with dynamic labelsβ34Updated 2 months ago
- β17Updated last month
- SUQL: Conversational Search over Structured and Unstructured Data with LLMsβ244Updated 3 weeks ago
- Testing and evaluation framework for voice agentsβ92Updated this week
- The easiest and most comprehensive framework for building enterprise-grade NL2SQL solutions at scale.β37Updated 2 months ago
- A curated list of awesome academic research, books, code of ethics, data sets, institutes, maturity models, newsletters, principles, podcβ¦β65Updated this week
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.β106Updated 5 months ago
- β70Updated 4 months ago
- β69Updated 10 months ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"β124Updated last year
- A Lightweight Library for AI Observabilityβ233Updated this week
- nbsynthetic is simple and robust tabular synthetic data generation library for small and medium size datasetsβ64Updated last year
- A curated list of awesome resources for creating synthetic dataβ41Updated 3 years ago
- A novel approach for synthesizing tabular data using pretrained large language modelsβ296Updated 3 months ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profilβ¦β72Updated 9 months ago