NVlabs / Jet-NemotronLinks
☆695Updated 2 weeks ago
Alternatives and similar repositories for Jet-Nemotron
Users that are interested in Jet-Nemotron are comparing it to the libraries listed below
Sorting:
- Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …☆541Updated last week
- DFloat11: Lossless LLM Compression for Efficient GPU Inference☆554Updated 2 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆775Updated last week
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆656Updated 3 weeks ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆682Updated last month
- CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning☆195Updated this week
- Sparse Inferencing for transformer based LLMs☆201Updated 2 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆297Updated 2 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆900Updated 4 months ago
- ☆445Updated this week
- Self-Adapting Language Models☆1,400Updated 2 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆127Updated 2 months ago
- An interface library for RL post training with environments.☆66Updated this week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆479Updated last week
- VPTQ, A Flexible and Extreme low-bit quantization algorithm☆659Updated 6 months ago
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆800Updated 3 months ago
- Post-training with Tinker☆1,096Updated last week
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆225Updated last week
- All information and news with respect to Falcon-H1 series☆91Updated 2 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆346Updated 4 months ago
- ☆828Updated last month
- Utils for Unsloth https://github.com/unslothai/unsloth☆162Updated this week
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆484Updated last month
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆161Updated 2 months ago
- GRadient-INformed MoE☆264Updated last year
- Inference, Fine Tuning and many more recipes with Gemma family of models☆273Updated 3 months ago
- ☆201Updated 10 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆463Updated 2 months ago
- Efficient LLM Inference over Long Sequences☆390Updated 4 months ago
- ☆300Updated 2 months ago