meta-llama / synthetic-data-kitLinks
Tool for generating high quality Synthetic datasets
☆1,081Updated last week
Alternatives and similar repositories for synthetic-data-kit
Users that are interested in synthetic-data-kit are comparing it to the libraries listed below
Sorting:
- An open-source tool for general prompt optimization.☆576Updated this week
- ☆677Updated 3 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,498Updated 6 months ago
- Build datasets using natural language☆505Updated 2 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆367Updated this week
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆824Updated this week
- Synthetic data curation for post-training and structured data extraction☆1,464Updated 3 weeks ago
- Fast State-of-the-Art Static Embeddings☆1,782Updated this week
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.☆987Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,540Updated last week
- Generate large synthetic data using an LLM☆438Updated this week
- ☆1,927Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆806Updated 6 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆925Updated 6 months ago
- Open source project for data preparation for GenAI applications☆754Updated this week
- NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAG☆342Updated 4 months ago
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.☆1,149Updated this week
- Implementing the 4 agentic patterns from scratch☆1,463Updated 4 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,183Updated 3 months ago
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆814Updated this week
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,351Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆1,621Updated this week
- 📝 Automatically annotate papers using LLMs☆332Updated 3 months ago
- A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions☆1,100Updated last month
- Big & Small LLMs working together☆1,088Updated this week
- 100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.☆2,697Updated last week
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,131Updated 6 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,766Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,451Updated 2 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆260Updated 2 weeks ago