statice / awesome-synthetic-dataLinks
A curated list of awesome synthetic data tools (open source and commercial).
☆186Updated last year
Alternatives and similar repositories for awesome-synthetic-data
Users that are interested in awesome-synthetic-data are comparing it to the libraries listed below
Sorting:
- 📖 A curated list of resources dedicated to synthetic data☆131Updated 2 years ago
- This repository stems from our paper, “Cataloguing LLM Evaluations”, and serves as a living, collaborative catalogue of LLM evaluation fr…☆17Updated last year
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆270Updated 3 weeks ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- ☆72Updated last year
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆112Updated this week
- Python SDK for running evaluations on LLM generated responses☆286Updated 3 weeks ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆122Updated last month
- Optimized Large Language Models for Financial Applications – Efficient, Scalable, and Domain-Specific AI for Finance.☆49Updated 2 months ago
- Fiddler Auditor is a tool to evaluate language models.☆183Updated last year
- Blueprint for federated finetuning, enabling multiple data owners to collaboratively fine-tune models without sharing raw data. Developed…☆35Updated 3 weeks ago
- A novel approach for synthesizing tabular data using pretrained large language models☆310Updated last month
- ☆20Updated 5 months ago
- Data for the Chat With Your Data benchmark.☆139Updated last year
- ☆72Updated 8 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆115Updated 4 months ago
- ☆144Updated 11 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆132Updated 2 weeks ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆104Updated 6 months ago
- Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data☆398Updated 6 months ago
- Synthetic Data SDK ✨☆571Updated last week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆106Updated 2 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆216Updated last week
- Sample notebooks and prompts for LLM evaluation☆135Updated 2 weeks ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 9 months ago
- A project that enables identification and classification of an intent of a message with dynamic labels☆41Updated 6 months ago
- Framework for building data agent workflows☆82Updated 10 months ago
- A curated list of awesome resources for creating synthetic data☆42Updated 3 years ago
- Client interface to Cleanlab Studio and the Trustworthy Language Model☆32Updated 4 months ago