NVlabs / Jet-NemotronLinks
☆702Updated last month
Alternatives and similar repositories for Jet-Nemotron
Users that are interested in Jet-Nemotron are comparing it to the libraries listed below
Sorting:
- ☆1,161Updated 2 weeks ago
- Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …☆570Updated 2 weeks ago
- DFloat11: Lossless LLM Compression for Efficient GPU Inference☆559Updated 2 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆712Updated last month
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆820Updated this week
- QeRL enables RL for 32B LLMs on a single H100 GPU.☆432Updated last month
- CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning☆244Updated 2 weeks ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆667Updated 2 weeks ago
- Sparse Inferencing for transformer based LLMs☆201Updated 3 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆299Updated 2 weeks ago
- ☆477Updated this week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆913Updated 5 months ago
- dLLM: Simple Diffusion Language Modeling☆529Updated last week
- Official implementation of "Continuous Autoregressive Language Models"☆470Updated last week
- An interface library for RL post training with environments.☆687Updated this week
- Utils for Unsloth https://github.com/unslothai/unsloth☆171Updated last week
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆504Updated last month
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆348Updated 4 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆129Updated 3 months ago
- A command-line interface tool for serving LLM using vLLM.☆441Updated 3 weeks ago
- VPTQ, A Flexible and Extreme low-bit quantization algorithm☆666Updated 6 months ago
- Advanced quantization toolkit for LLMs. Native support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Bits and seamless integration with Transform…☆712Updated this week
- ☆894Updated 2 weeks ago
- ☆200Updated 11 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,162Updated 9 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆327Updated last year
- Dream 7B, a large diffusion language model☆1,081Updated last month
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆805Updated 4 months ago
- ☆300Updated 3 months ago
- AlphaGo Moment for Model Architecture Discovery.☆1,114Updated 3 months ago