meta-llama / synthetic-data-kitLinks
Tool for generating high quality Synthetic datasets
☆1,379Updated 2 weeks ago
Alternatives and similar repositories for synthetic-data-kit
Users that are interested in synthetic-data-kit are comparing it to the libraries listed below
Sorting:
- An open-source tool for LLM prompt optimization.☆703Updated this week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,882Updated last month
- Build datasets using natural language☆543Updated last month
- ☆1,183Updated last month
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,431Updated 5 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆411Updated last month
- Synthetic data curation for post-training and structured data extraction☆1,547Updated 3 months ago
- Fast State-of-the-Art Static Embeddings☆1,898Updated last month
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆1,059Updated this week
- ☆687Updated 6 months ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.☆1,071Updated last week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆828Updated 9 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,302Updated 9 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,078Updated last week
- Training Model Behavior in Agentic Systems☆651Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,349Updated 6 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,927Updated this week
- Optimize prompts, code, and more with AI-powered Reflective Text Evolution☆1,526Updated this week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆473Updated 2 months ago
- Implementing the 4 agentic patterns from scratch☆1,618Updated 7 months ago
- An interface library for RL post training with environments.☆628Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,776Updated 2 weeks ago
- A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions☆1,142Updated 4 months ago
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.☆1,496Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,510Updated 5 months ago
- On the Theoretical Limitations of Embedding-Based Retrieval☆591Updated last month
- An Open Source Toolkit For LLM Distillation☆777Updated 4 months ago
- Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs☆1,859Updated last month
- 📝 Automatically annotate papers using LLMs☆359Updated 6 months ago
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,056Updated last week