meta-llama / synthetic-data-kitLinks
Tool for generating high quality Synthetic datasets
β1,400Updated last month
Alternatives and similar repositories for synthetic-data-kit
Users that are interested in synthetic-data-kit are comparing it to the libraries listed below
Sorting:
- An open-source tool for LLM prompt optimization.β711Updated 3 weeks ago
- π€ Benchmark Large Language Models Reliably On Your Dataβ412Updated this week
- Synthetic data curation for post-training and structured data extractionβ1,564Updated 4 months ago
- β692Updated 7 months ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detectionβ1,075Updated last week
- Build datasets using natural languageβ547Updated 2 months ago
- An interface library for RL post training with environments.β753Updated last week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,965Updated last month
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.β1,549Updated this week
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.β1,078Updated this week
- Train LLM Model Behaviorβ662Updated this week
- β1,242Updated last month
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAGβ1,437Updated 6 months ago
- Fast State-of-the-Art Static Embeddingsβ1,917Updated 2 weeks ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ2,141Updated last week
- Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.β886Updated 9 months ago
- β2,078Updated last week
- Open source project for data preparation for GenAI applicationsβ858Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,956Updated this week
- A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactionsβ1,151Updated 5 months ago
- An Open Source Toolkit For LLM Distillationβ785Updated 4 months ago
- AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accβ¦β1,355Updated last week
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.β¦β451Updated 8 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ479Updated 3 months ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,172Updated 10 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,794Updated last month
- Automatically evaluate your LLMs in Google Colabβ671Updated last year
- Optimize prompts, code, and more with AI-powered Reflective Text Evolutionβ1,698Updated 2 weeks ago
- β1,168Updated last month
- Implementing the 4 agentic patterns from scratchβ1,638Updated 8 months ago