meta-llama / synthetic-data-kitLinks
Tool for generating high quality Synthetic datasets
☆1,427Updated last month
Alternatives and similar repositories for synthetic-data-kit
Users that are interested in synthetic-data-kit are comparing it to the libraries listed below
Sorting:
- An open-source tool for LLM prompt optimization.☆728Updated 3 weeks ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆1,089Updated this week
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,447Updated 6 months ago
- Synthetic data curation for post-training and structured data extraction☆1,577Updated 4 months ago
- Build datasets using natural language☆552Updated 3 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆419Updated this week
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.☆1,625Updated this week
- ☆1,252Updated 2 months ago
- An interface library for RL post training with environments.☆848Updated last week
- Fast State-of-the-Art Static Embeddings☆1,956Updated last month
- ☆693Updated 7 months ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.☆1,080Updated this week
- GenAI Processors is a lightweight Python library that enables efficient, parallel content processing.☆2,015Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆834Updated 10 months ago
- Implementing the 4 agentic patterns from scratch☆1,650Updated 9 months ago
- Open source project for data preparation for GenAI applications☆867Updated last week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆2,007Updated 3 weeks ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,404Updated 7 months ago
- Curate High Quality Datasets, Train, Evaluate and Ship! 🚀☆676Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,810Updated last month
- Our library for RL environments + evals☆3,655Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,202Updated last week
- 📝 Automatically annotate papers using LLMs☆391Updated 3 weeks ago
- A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions☆1,158Updated 2 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,971Updated last week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,177Updated 10 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,314Updated 10 months ago
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆926Updated last week
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,152Updated last week
- OctoTools: An agentic framework with extensible tools for complex reasoning☆1,403Updated 2 months ago