meta-llama / synthetic-data-kitLinks
Tool for generating high quality Synthetic datasets
β1,010Updated this week
Alternatives and similar repositories for synthetic-data-kit
Users that are interested in synthetic-data-kit are comparing it to the libraries listed below
Sorting:
- An open-source tool for general prompt optimization.β557Updated this week
- π€ Benchmark Large Language Models Reliably On Your Dataβ354Updated last week
- β673Updated 2 months ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.β970Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,507Updated last week
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detectionβ772Updated this week
- Synthetic data curation for post-training and structured data extractionβ1,434Updated this week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,456Updated 6 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β802Updated 5 months ago
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAGβ1,332Updated last month
- Build datasets using natural languageβ498Updated 2 months ago
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.β1,098Updated this week
- GenAI Processors is a lightweight Python library that enables efficient, parallel content processing.β108Updated last week
- β679Updated this week
- Fast State-of-the-Art Static Embeddingsβ1,752Updated last month
- β1,857Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,147Updated 2 months ago
- Generate large synthetic data using an LLMβ432Updated this week
- π Automatically annotate papers using LLMsβ328Updated 2 months ago
- Implementing the 4 agentic patterns from scratchβ1,413Updated 3 months ago
- NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAGβ335Updated 3 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desigβ¦β923Updated 5 months ago
- A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactionsβ1,084Updated last week
- This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated β¦β1,328Updated 3 weeks ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,123Updated 5 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,441Updated last month
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β315Updated last month
- Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.β876Updated 4 months ago
- Readymade evaluators for your LLM appsβ624Updated 3 weeks ago
- β679Updated 3 weeks ago