NVIDIA-NeMo / DataDesignerLinks
🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.
☆674Updated this week
Alternatives and similar repositories for DataDesigner
Users that are interested in DataDesigner are comparing it to the libraries listed below
Sorting:
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- A CLI to estimate inference memory requirements for Hugging Face models, written in Python.☆646Updated last week
- An open-source tool for LLM prompt optimization.☆759Updated last week
- Provider-agnostic, open-source evaluation infrastructure for language models☆719Updated last month
- An interface library for RL post training with environments.☆1,112Updated this week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆496Updated 5 months ago
- On the Theoretical Limitations of Embedding-Based Retrieval☆622Updated 4 months ago
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆426Updated last month
- ☆278Updated last week
- ☆237Updated 2 months ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆874Updated this week
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆1,070Updated this week
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆592Updated last month
- Benchmark and optimize LLM inference across frameworks with ease☆161Updated 4 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆459Updated 5 months ago
- a1facts - the precision layer for AI agents☆64Updated 4 months ago
- Tool for generating high quality Synthetic datasets☆1,484Updated 3 months ago
- ☆98Updated 3 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆140Updated 5 months ago
- From data to vector database effortlessly☆90Updated 8 months ago
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end refere…☆392Updated this week
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆275Updated last week
- Salesforce Enterprise Deep Research☆1,064Updated last week
- Making docling agentic through MCP☆395Updated 2 weeks ago
- Docling LangChain integration☆63Updated 2 months ago
- Ranking LLMs on agentic tasks☆210Updated 2 months ago
- Build datasets using natural language☆566Updated 4 months ago
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,244Updated last week