argilla-io / synthetic-data-generatorLinks
Build datasets using natural language
β556Updated 3 months ago
Alternatives and similar repositories for synthetic-data-generator
Users that are interested in synthetic-data-generator are comparing it to the libraries listed below
Sorting:
- π€ Benchmark Large Language Models Reliably On Your Dataβ419Updated last week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β348Updated 6 months ago
- Curate High Quality Datasets, Train, Evaluate and Ship! πβ753Updated this week
- awesome synthetic (text) datasetsβ315Updated last month
- Automatically evaluate your LLMs in Google Colabβ677Updated last year
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engineβ491Updated 5 months ago
- A Lightweight Library for AI Observabilityβ252Updated 10 months ago
- Simple UI for debugging correlations of text embeddingsβ306Updated 6 months ago
- Code for explaining and evaluating late chunking (chunked pooling)β476Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ275Updated last year
- A flexible, adaptive classification system for dynamic text classificationβ517Updated 2 months ago
- β235Updated last month
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!β461Updated last year
- An open-source tool for LLM prompt optimization.β734Updated last week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ486Updated 4 months ago
- Tutorial for building LLM routerβ239Updated last year
- An Open Source Toolkit For LLM Distillationβ814Updated last week
- Together Open Deep Researchβ356Updated 8 months ago
- β267Updated 6 months ago
- Solving data for LLMs - Create quality synthetic datasets!β150Updated 11 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β181Updated last year
- This is the official repository for Auto-RAG.β231Updated 5 months ago
- β693Updated 7 months ago
- Fast Semantic Text Deduplication & Filteringβ859Updated 2 months ago
- A small library of LLM judgesβ309Updated 4 months ago
- LettuceDetect is a hallucination detection framework for RAG applications.β520Updated 3 months ago
- Structured information extraction from documentsβ319Updated last year
- Framework for enhancing LLMs for RAG tasks using fine-tuning.β759Updated last week
- An Awesome list of curated DSPy resources.β492Updated 2 weeks ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β835Updated 10 months ago