argilla-io / synthetic-data-generatorLinks
Build datasets using natural language
β529Updated 2 weeks ago
Alternatives and similar repositories for synthetic-data-generator
Users that are interested in synthetic-data-generator are comparing it to the libraries listed below
Sorting:
- π€ Benchmark Large Language Models Reliably On Your Dataβ398Updated this week
- A Lightweight Library for AI Observabilityβ251Updated 7 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β336Updated 4 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ275Updated last year
- Automatically evaluate your LLMs in Google Colabβ661Updated last year
- awesome synthetic (text) datasetsβ297Updated 3 months ago
- π Automatically annotate papers using LLMsβ355Updated 5 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!β451Updated last year
- Simple UI for debugging correlations of text embeddingsβ292Updated 4 months ago
- β264Updated 3 months ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.β750Updated 4 months ago
- Create large-scale synthetic training data for model distillation and evaluationβ581Updated this week
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β179Updated last year
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ456Updated last month
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β240Updated 11 months ago
- β682Updated 5 months ago
- Code for explaining and evaluating late chunking (chunked pooling)β452Updated 9 months ago
- β155Updated 5 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and β¦β344Updated last year
- An Open Source Toolkit For LLM Distillationβ732Updated 2 months ago
- One click templates for inferencing Language Modelsβ213Updated 2 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engineβ480Updated 2 months ago
- Tutorial for building LLM routerβ228Updated last year
- Solving data for LLMs - Create quality synthetic datasets!β151Updated 8 months ago
- An open-source tool for general prompt optimization.β637Updated last week
- β146Updated last year
- β232Updated 3 months ago
- This is the official repository for Auto-RAG.β225Updated 2 months ago
- Together Open Deep Researchβ352Updated 5 months ago
- Ranking LLMs on agentic tasksβ192Updated 3 weeks ago