🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.
☆866Mar 20, 2026Updated this week
Alternatives and similar repositories for DataDesigner
Users that are interested in DataDesigner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Torque is a Declarative, typesafe DSL for building synthetic LLM datasets — compose conversations like React components☆89Nov 19, 2025Updated 4 months ago
- Run GEPA on your favorite non-python libraries.☆33Jan 22, 2026Updated 2 months ago
- Open-source library for scalable, reproducible evaluation of AI models and benchmarks.☆240Updated this week
- Agentkube - Run Kubernetes Like Never Before☆37Mar 1, 2026Updated 3 weeks ago
- ☆38Jan 19, 2026Updated 2 months ago
- ☆15Jun 27, 2023Updated 2 years ago
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 5 months ago
- Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama s…☆71Dec 22, 2025Updated 3 months ago
- Storing long contexts in tiny caches with self-study☆249Dec 5, 2025Updated 3 months ago
- ☆27Feb 11, 2026Updated last month
- CLI for Recursive Language Models☆63Jan 28, 2026Updated last month
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆348Sep 12, 2025Updated 6 months ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Build RL environments for LLM training☆758Updated this week
- ☆55Mar 3, 2026Updated 2 weeks ago
- Alpaca-LoRA as Chatbot service☆13Mar 30, 2023Updated 2 years ago
- ChromaDB Data Pipes 🖇️ - The easiest way to get data into and out of ChromaDB☆20Oct 22, 2024Updated last year
- DocumentDB is the open-source engine powering vCore-based Azure Cosmos DB for MongoDB. It offers a native implementation of document-orie…☆22Feb 8, 2026Updated last month
- ☆22Mar 6, 2024Updated 2 years ago
- ☆11Aug 26, 2024Updated last year
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Sep 19, 2023Updated 2 years ago
- Scalable toolkit for efficient model reinforcement☆1,447Updated this week
- This is a companion repository for the On Prem RAG AIM Event☆11Nov 30, 2024Updated last year
- Semantic search and document parsing tools for the command line☆1,754Mar 11, 2026Updated last week
- TaskWeaver Plugins☆12Jan 28, 2024Updated 2 years ago
- vLLM adapter for a TGIS-compatible gRPC server.☆55Updated this week
- ☆99Feb 27, 2026Updated 3 weeks ago
- Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training☆56Jul 28, 2025Updated 7 months ago
- ☆58Feb 27, 2025Updated last year
- ☆27Updated this week
- 🚀 Lightweight Python library for building production LLM applications with smart context management and automatic token optimization. Sa…☆36Dec 23, 2025Updated 3 months ago
- Tree Indexing for Long Conversations☆108Jan 8, 2026Updated 2 months ago
- Learn AI development without frameworks. How ChatGPT, RAG, and AI agents actually work through 10 progressive modules. Just Python, APIs…☆38Mar 8, 2026Updated 2 weeks ago
- NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extracti…☆2,874Updated this week
- Transparent Reporting of Ethics for Generative AI (TREGAI) Checklist☆15Oct 16, 2024Updated last year
- 헬스케어 AI 경진대회☆10Dec 16, 2023Updated 2 years ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆191Jan 12, 2026Updated 2 months ago
- ☆12Sep 25, 2024Updated last year
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,961Updated this week