NVIDIA-NeMo / DataDesignerLinks
🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.
☆620Updated this week
Alternatives and similar repositories for DataDesigner
Users that are interested in DataDesigner are comparing it to the libraries listed below
Sorting:
- An open-source tool for LLM prompt optimization.☆742Updated last week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆495Updated 4 months ago
- A CLI to estimate inference memory requirements for Hugging Face models, written in Python.☆261Updated this week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- An interface library for RL post training with environments.☆1,004Updated this week
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆855Updated this week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆423Updated 2 weeks ago
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference exampl…☆349Updated this week
- ☆257Updated this week
- Build datasets using natural language☆558Updated 3 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆140Updated 4 months ago
- On the Theoretical Limitations of Embedding-Based Retrieval☆617Updated 4 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆459Updated 4 months ago
- A Lightweight Library for AI Observability☆253Updated 10 months ago
- ☆124Updated 3 months ago
- Simple UI for debugging correlations of text embeddings☆305Updated 7 months ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆804Updated last week
- ☆218Updated 6 months ago
- RAG evaluation without the need for "golden answers"☆333Updated last month
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆256Updated this week
- ☆182Updated 10 months ago
- Tool for generating high quality Synthetic datasets☆1,463Updated 2 months ago
- ☆212Updated 7 months ago
- From data to vector database effortlessly☆89Updated 7 months ago
- Provider-agnostic, open-source evaluation infrastructure for language models☆705Updated 3 weeks ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆303Updated this week
- Verifiers for LLM Reinforcement Learning☆80Updated 4 months ago
- Together Open Deep Research☆356Updated 8 months ago
- A comprehensive 0-to-1 guide for building self-improving LLM applications with DSPy framework☆199Updated 3 months ago
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆269Updated this week