meta-llama / synthetic-data-kit
Tool for generating high quality Synthetic datasets
β156Updated this week
Alternatives and similar repositories for synthetic-data-kit:
Users that are interested in synthetic-data-kit are comparing it to the libraries listed below
- An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.β63Updated this week
- π€ Benchmark Large Language Models Reliably On Your Dataβ281Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β776Updated 3 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β277Updated last week
- The NVIDIA Agent Intelligence Toolkit (AIQ Toolkit) is an open-source library for efficiently connecting and optimizing teams of AI agentβ¦β764Updated this week
- β582Updated this week
- Generate large synthetic data using an LLMβ412Updated this week
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.β890Updated this week
- Semantic Chunker is a lightweight Python package for semantically-aware chunking and clustering of text.β235Updated 3 weeks ago
- π Automatically annotate papers using LLMsβ318Updated 2 weeks ago
- βοΈGenAI powered multi-agentic medical diagnostics and healthcare research assistance chatbot. π₯ Designed for healthcare professionals, rβ¦β389Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β900Updated this week
- Build datasets using natural languageβ465Updated 2 months ago
- A Lightweight Library for AI Observabilityβ243Updated 2 months ago
- Readymade evaluators for your LLM appsβ356Updated this week
- Tutorial for building LLM routerβ198Updated 9 months ago
- NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAGβ321Updated last month
- A modern template for agentic orchestration β built for rapid iteration and scalable deployment using highly customizable, community-suppβ¦β349Updated this week
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.β304Updated last month
- β643Updated this week
- Readymade evaluators for agent trajectoriesβ183Updated this week
- Synthetic data curation for post-training and structured data extractionβ1,290Updated this week
- Implementing the 4 agentic patterns from scratchβ1,259Updated last month
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!β743Updated this week
- SwarmZero's SDK for building AI agents, swarms of agents and much more.β238Updated 2 months ago
- Dynamiq is an orchestration framework for agentic AI and LLM applicationsβ889Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,408Updated 2 weeks ago
- A non-official CLI for Llama Index Parserβ212Updated 9 months ago
- CodeScientist: An automated scientific discovery system for code-based experimentsβ237Updated last month
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desigβ¦β915Updated 3 months ago