meta-llama / synthetic-data-kitLinks
Tool for generating high quality Synthetic datasets
☆1,130Updated 2 weeks ago
Alternatives and similar repositories for synthetic-data-kit
Users that are interested in synthetic-data-kit are comparing it to the libraries listed below
Sorting:
- An open-source tool for general prompt optimization.☆602Updated 3 weeks ago
- Synthetic data curation for post-training and structured data extraction☆1,483Updated 3 weeks ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆385Updated 2 weeks ago
- Build datasets using natural language☆515Updated 3 months ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆870Updated last week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,540Updated 7 months ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.☆1,006Updated this week
- ☆678Updated 3 months ago
- Fast State-of-the-Art Static Embeddings☆1,801Updated last week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,571Updated last week
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,361Updated 2 months ago
- Generate large synthetic data using an LLM☆441Updated this week
- Implementing the 4 agentic patterns from scratch☆1,507Updated 5 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,245Updated 3 months ago
- Open source project for data preparation for GenAI applications☆765Updated this week
- Verifiers for LLM Reinforcement Learning☆1,780Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆813Updated 6 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆926Updated 6 months ago
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.☆1,245Updated this week
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆828Updated this week
- 📝 Automatically annotate papers using LLMs☆339Updated 4 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,464Updated 3 months ago
- A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions☆1,109Updated last month
- ☆1,953Updated this week
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆915Updated 2 months ago
- A lightweight, local-first, and free experiment tracking Python library built on top of 🤗 Datasets and Spaces.☆647Updated this week
- A system for agentic LLM-powered data processing and ETL☆2,713Updated this week
- ☆973Updated this week
- Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.☆731Updated last week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆839Updated 2 months ago