argilla-io / synthetic-data-generatorLinks
Build datasets using natural language
☆548Updated 2 months ago
Alternatives and similar repositories for synthetic-data-generator
Users that are interested in synthetic-data-generator are comparing it to the libraries listed below
Sorting:
- 🤗 Benchmark Large Language Models Reliably On Your Data☆414Updated last week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆343Updated 6 months ago
- Automatically evaluate your LLMs in Google Colab☆673Updated last year
- awesome synthetic (text) datasets☆314Updated 2 weeks ago
- Train LLM Model Behavior☆667Updated this week
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆458Updated last year
- Simple UI for debugging correlations of text embeddings☆302Updated 6 months ago
- A Lightweight Library for AI Observability☆252Updated 9 months ago
- An open-source tool for LLM prompt optimization.☆717Updated last week
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- Code for explaining and evaluating late chunking (chunked pooling)☆470Updated 11 months ago
- Fast Semantic Text Deduplication & Filtering☆852Updated last month
- 📝 Automatically annotate papers using LLMs☆391Updated this week
- ☆266Updated 5 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆479Updated 3 months ago
- Tutorial for building LLM router☆236Updated last year
- An Open Source Toolkit For LLM Distillation☆787Updated 4 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆488Updated 4 months ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆758Updated 6 months ago
- ☆158Updated 7 months ago
- ☆234Updated last week
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆251Updated 4 months ago
- A flexible, adaptive classification system for dynamic text classification☆510Updated 2 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆832Updated 10 months ago
- Together Open Deep Research☆355Updated 7 months ago
- ☆692Updated 7 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆346Updated last year
- An Awesome list of curated DSPy resources.☆481Updated 2 months ago
- Tool for generating high quality Synthetic datasets☆1,411Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 4 months ago