NVIDIA-NeMo / NemotronLinks

Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference examples to build with Nemotron models

☆314

Alternatives and similar repositories for Nemotron

Users that are interested in Nemotron are comparing it to the libraries listed below

Sorting:

facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆250Updated this week
NVlabs / ToolOrchestra
ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.
☆450Updated 2 weeks ago
SakanaAI / natural_niches
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆169Updated 4 months ago
google-deepmind / regress-lm
Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…
☆305Updated 3 weeks ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆306Updated last month
huggingface / gpt-oss-recipes
Collection of scripts and notebooks for OpenAI's latest GPT OSS models
☆494Updated 4 months ago
meta-pytorch / OpenEnv
An interface library for RL post training with environments.
☆973Updated this week
LLMSELECTOR / LLMSELECTOR
☆79Updated 3 months ago
facebookresearch / cwm
Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.
☆792Updated 2 weeks ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆276Updated this week
meta-pytorch / torchforge
PyTorch-native post-training at scale
☆584Updated this week
huggingface / huggingface-gemma-recipes
Inference, Fine Tuning and many more recipes with Gemma family of models
☆276Updated 5 months ago
alexzhang13 / rlm-minimal
Super basic implementation (gist-like) of RLMs with REPL environments.
☆390Updated this week
deepreinforce-ai / CUDA-L1
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning
☆277Updated 2 months ago
bentoml / llm-optimizer
Benchmark and optimize LLM inference across frameworks with ease
☆153Updated 3 months ago
NVIDIA-NeMo / DataDesigner
🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.
☆603Updated last week
NVIDIA-NeMo / Automodel
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
☆232Updated this week
SakanaAI / RLT
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
☆355Updated 6 months ago
zhengkid / Parallel-R1
The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"
☆250Updated last month
open-tinker / OpenTinker
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆499Updated last week
SakanaAI / treequest
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
☆512Updated last month
allenai / bolmo-core
Code for Bolmo: Byteifying the Next Generation of Language Models
☆112Updated 2 weeks ago
open-thoughts / OpenThoughts-Agent
Data recipes and robust infrastructure for training AI agents
☆75Updated this week
openai / circuit_sparsity
Open-source release accompanying Gao et al. 2025
☆486Updated 3 weeks ago
huggingface / kernel-builder
👷 Build compute kernels
☆198Updated 2 weeks ago
eqimp / hogwild_llm
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
☆137Updated 4 months ago
huggingface / smol2operator
☆126Updated 3 months ago
pyember / ember
☆233Updated this week
SakanaAI / ab-mcts-arc2
☆106Updated 6 months ago
google / lmeval
☆236Updated last month