argilla-io / synthetic-data-generatorLinks
Build datasets using natural language
☆518Updated 3 months ago
Alternatives and similar repositories for synthetic-data-generator
Users that are interested in synthetic-data-generator are comparing it to the libraries listed below
Sorting:
- 🤗 Benchmark Large Language Models Reliably On Your Data☆387Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆326Updated 2 months ago
- Generate large synthetic data using an LLM☆441Updated this week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆398Updated 2 weeks ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆266Updated last month
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆477Updated last month
- ☆262Updated 2 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆440Updated 8 months ago
- awesome synthetic (text) datasets☆293Updated last month
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Automatically evaluate your LLMs in Google Colab☆656Updated last year
- A Lightweight Library for AI Observability☆250Updated 6 months ago
- ☆229Updated last month
- Tool for generating high quality Synthetic datasets☆1,139Updated 3 weeks ago
- 📝 Automatically annotate papers using LLMs☆343Updated 4 months ago
- Fast Semantic Text Deduplication & Filtering☆795Updated 2 weeks ago
- One click templates for inferencing Language Models☆211Updated 3 weeks ago
- Together Open Deep Research☆338Updated 4 months ago
- ☆155Updated 4 months ago
- ☆678Updated 3 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆436Updated 11 months ago
- Simple UI for debugging correlations of text embeddings☆289Updated 2 months ago
- An Open Source Toolkit For LLM Distillation☆717Updated last month
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆747Updated 3 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆236Updated 9 months ago
- From data to vector database effortlessly☆79Updated 3 months ago
- A flexible, adaptive classification system for dynamic text classification☆424Updated this week
- ☆180Updated 6 months ago
- This is the official repository for Auto-RAG.☆218Updated last month
- An open-source tool for general prompt optimization.☆606Updated this week