NVlabs / Jet-NemotronLinks
☆716Updated 3 weeks ago
Alternatives and similar repositories for Jet-Nemotron
Users that are interested in Jet-Nemotron are comparing it to the libraries listed below
Sorting:
- ☆1,245Updated last month
- Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …☆585Updated this week
- CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning☆277Updated last month
- DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference☆576Updated last month
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆871Updated this week
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆553Updated last month
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆778Updated this week
- QeRL enables RL for 32B LLMs on a single H100 GPU.☆470Updated last month
- Open-source release accompanying Gao et al. 2025☆471Updated 2 weeks ago
- dLLM: Simple Diffusion Language Modeling☆1,504Updated this week
- Sparse Inferencing for transformer based LLMs☆215Updated 4 months ago
- The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆304Updated this week
- Simple & Scalable Pretraining for Neural Architecture Research☆305Updated 3 weeks ago
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆430Updated this week
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆526Updated 3 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆710Updated last week
- Advanced quantization toolkit for LLMs and VLMs. Support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Schemes and seamless integration with Tra…☆775Updated this week
- Official implementation of "Continuous Autoregressive Language Models"☆677Updated 3 weeks ago
- ☆617Updated this week
- A command-line interface tool for serving LLM using vLLM.☆456Updated 3 weeks ago
- A framework for efficient model inference with omni-modality models☆1,335Updated this week
- ☆426Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆353Updated 6 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆933Updated 6 months ago
- dInfer: An Efficient Inference Framework for Diffusion Language Models☆373Updated this week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆509Updated 2 weeks ago
- An interface library for RL post training with environments.☆859Updated this week
- ☆205Updated last year
- AlphaGo Moment for Model Architecture Discovery.☆1,125Updated 3 weeks ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆136Updated 4 months ago