statice / awesome-synthetic-data
A curated list of awesome synthetic data tools (open source and commercial).
β133Updated last year
Alternatives and similar repositories for awesome-synthetic-data:
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
- π A curated list of resources dedicated to synthetic dataβ123Updated 2 years ago
- Fiddler Auditor is a tool to evaluate language models.β174Updated 10 months ago
- Sample notebooks and prompts for LLM evaluationβ119Updated last month
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β82Updated this week
- π A curated list of papers & technical articles on AI Quality & Safetyβ166Updated last year
- Framework for building data agent workflowsβ84Updated 4 months ago
- A novel approach for synthesizing tabular data using pretrained large language modelsβ295Updated 2 months ago
- A curated list of awesome resources for creating synthetic dataβ41Updated 2 years ago
- Automated knowledge graph creation SDKβ119Updated last month
- Pebblo enables developers to safely load data and promote their Gen AI app to deploymentβ141Updated 2 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraphβ145Updated 9 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicatiβ¦β233Updated 3 months ago
- A notebook based tutorial series on buildling a LLM from scratchβ24Updated 4 months ago
- A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.β218Updated last month
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.β104Updated 4 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ97Updated last month
- π¦π― Flex those feathers!β236Updated 2 months ago
- CodeSage: Code Representation Learning At Scale (ICLR 2024)β89Updated 2 months ago
- Red-Teaming Language Models with DSPyβ153Updated 9 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β79Updated 11 months ago
- Python SDK for running evaluations on LLM generated responsesβ253Updated last week
- Automatic Evals for Instruction-Tuned Modelsβ100Updated this week
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"β121Updated last year
- SUQL: Conversational Search over Structured and Unstructured Data with LLMsβ237Updated last month
- β137Updated 5 months ago
- β67Updated 2 months ago
- Data for the Chat With Your Data benchmark.β128Updated last year
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.β105Updated this week
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.β294Updated last month
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated 11 months ago