meta-llama / synthetic-data-kitLinks
Tool for generating high quality Synthetic datasets
β878Updated last week
Alternatives and similar repositories for synthetic-data-kit
Users that are interested in synthetic-data-kit are comparing it to the libraries listed below
Sorting:
- An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.β331Updated last week
- π€ Benchmark Large Language Models Reliably On Your Dataβ315Updated this week
- Fast State-of-the-Art Static Embeddingsβ1,688Updated this week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,385Updated 4 months ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detectionβ598Updated last week
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.β920Updated this week
- Build datasets using natural languageβ479Updated 3 weeks ago
- β629Updated this week
- Synthetic data curation for post-training and structured data extractionβ1,364Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,424Updated last week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β787Updated 4 months ago
- This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated β¦β1,231Updated 2 weeks ago
- Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.β871Updated 3 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,117Updated last month
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,461Updated 3 weeks ago
- Automatically evaluate your LLMs in Google Colabβ629Updated last year
- β1,735Updated this week
- Make any LLM to think like OpenAI o1 and deepseek R1β489Updated 3 months ago
- β656Updated last month
- Large Concept Models: Language modeling in a sentence representation spaceβ2,206Updated 4 months ago
- Implementing the 4 agentic patterns from scratchβ1,338Updated 2 months ago
- βοΈGenAI powered multi-agentic medical diagnostics and healthcare research assistance chatbot. π₯ Designed for healthcare professionals, rβ¦β449Updated 3 weeks ago
- A system for agentic LLM-powered data processing and ETLβ1,987Updated last week
- π Automatically annotate papers using LLMsβ321Updated last month
- The NVIDIA Agent Intelligence (AIQ) toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.β972Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,563Updated last week
- A reading list on LLM based Synthetic Data Generation π₯β1,280Updated 2 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,436Updated this week
- Together Open Deep Researchβ298Updated last month
- Framework for enhancing LLMs for RAG tasks using fine-tuning.β739Updated last week