argilla-io / synthetic-data-generatorLinks
Build datasets using natural language
☆507Updated 2 months ago
Alternatives and similar repositories for synthetic-data-generator
Users that are interested in synthetic-data-generator are comparing it to the libraries listed below
Sorting:
- 🤗 Benchmark Large Language Models Reliably On Your Data☆381Updated this week
- Generate large synthetic data using an LLM☆438Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆318Updated 2 months ago
- Automatically evaluate your LLMs in Google Colab☆649Updated last year
- 📝 Automatically annotate papers using LLMs☆332Updated 3 months ago
- A Lightweight Library for AI Observability☆250Updated 5 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆747Updated 2 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆477Updated 2 weeks ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆262Updated 2 weeks ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆429Updated 11 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆427Updated 7 months ago
- Simple UI for debugging correlations of text embeddings☆288Updated 2 months ago
- Together Open Deep Research☆331Updated 3 months ago
- ☆677Updated 3 months ago
- Ranking LLMs on agentic tasks☆176Updated 3 weeks ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆824Updated this week
- An Open Source Toolkit For LLM Distillation☆698Updated 3 weeks ago
- ☆260Updated last month
- ☆155Updated 3 months ago
- Tool for generating high quality Synthetic datasets☆1,100Updated this week
- awesome synthetic (text) datasets☆291Updated last month
- This is the official repository for Auto-RAG.☆218Updated 2 weeks ago
- ☆222Updated last month
- An open-source tool for general prompt optimization.☆590Updated this week
- Tutorial for building LLM router☆221Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 6 months ago
- A flexible, adaptive classification system for dynamic text classification☆351Updated 2 weeks ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆230Updated 9 months ago
- Lightweight and portable LLM sandbox runtime (code interpreter) Python library.☆421Updated this week