statice / awesome-synthetic-data
A curated list of awesome synthetic data tools (open source and commercial).
β107Updated 10 months ago
Related projects β
Alternatives and complementary repositories for awesome-synthetic-data
- π A curated list of resources dedicated to synthetic dataβ118Updated 2 years ago
- Automated knowledge graph creation SDKβ113Updated 4 months ago
- A curated list of awesome resources for creating synthetic dataβ39Updated 2 years ago
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.β596Updated this week
- Fiddler Auditor is a tool to evaluate language models.β171Updated 8 months ago
- Sample notebooks and prompts for LLM evaluationβ114Updated last week
- Data for the Chat With Your Data benchmark.β126Updated 11 months ago
- Mistral + Haystack: build RAG pipelines that rock π€β100Updated 9 months ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β212Updated this week
- Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.β29Updated this week
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ146Updated 7 months ago
- Metrics to evaluate quality and efficacy of synthetic datasets.β212Updated this week
- π©π»βπ³ A collection of example notebooksβ378Updated this week
- π A curated list of papers & technical articles on AI Quality & Safetyβ161Updated last year
- Tutorial for building LLM routerβ163Updated 4 months ago
- Welcome to the Natural Language to SQL demo project using LlamaIndex! This application is designed to demonstrate the innovative use of Lβ¦β67Updated 7 months ago
- The Gretel Python Client allows you to interact with the Gretel REST API.β53Updated this week
- Integrating knowledge graphs (KG) with large language models (LLM)β76Updated 2 months ago
- Test LLMs automatically with Giskard and CI/CDβ28Updated 3 months ago
- Generative models to automatically anonymize data to meet GDPR & CCPA standards.β30Updated last year
- Notebooks and articles related to LLMsβ24Updated 10 months ago
- A novel approach for synthesizing tabular data using pretrained large language modelsβ284Updated 3 weeks ago
- Public blueprints for data use casesβ72Updated this week
- Open source repo for the WhyHow Knowledge Graph Studioβ36Updated this week
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicatiβ¦β222Updated last month
- β66Updated 6 months ago
- β64Updated 7 months ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack version 2.0 and onwardsβ121Updated this week
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β79Updated 9 months ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMsβ213Updated last week