NVIDIA-NeMo / NemotronLinks
Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference examples to build with Nemotron models
☆173Updated this week
Alternatives and similar repositories for Nemotron
Users that are interested in Nemotron are comparing it to the libraries listed below
Sorting:
- Simple & Scalable Pretraining for Neural Architecture Research☆305Updated 2 weeks ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆241Updated last week
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆301Updated this week
- 👷 Build compute kernels☆193Updated this week
- Train, tune, and infer Bamba model☆137Updated 6 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆354Updated 5 months ago
- PyTorch-native post-training at scale☆566Updated this week
- Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support☆209Updated this week
- CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning☆275Updated last month
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆136Updated 4 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆481Updated 3 months ago
- Open-source release accompanying Gao et al. 2025☆218Updated last week
- All information and news with respect to Falcon-H1 series☆93Updated 2 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆360Updated last year
- Training API and CLI☆266Updated this week
- Super basic implementation (gist-like) of RLMs with REPL environments.☆280Updated 2 months ago
- RLP: Reinforcement as a Pretraining Objective☆213Updated 2 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆277Updated 3 weeks ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆272Updated this week
- QeRL enables RL for 32B LLMs on a single H100 GPU.☆467Updated 3 weeks ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆121Updated 2 months ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆321Updated last month
- Benchmark and optimize LLM inference across frameworks with ease☆150Updated 3 months ago
- Data recipes and robust infrastructure for training AI agents☆61Updated this week
- ☆219Updated 10 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆167Updated 3 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆349Updated 7 months ago
- GRadient-INformed MoE☆265Updated last year
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆864Updated last week
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆379Updated last week