NVlabs / NFTLinks
Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasoning"
☆67Updated 4 months ago
Alternatives and similar repositories for NFT
Users that are interested in NFT are comparing it to the libraries listed below
Sorting:
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆380Updated 3 weeks ago
- Easy and Efficient dLLM Fine-Tuning☆190Updated 3 weeks ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆51Updated 5 months ago
- ☆64Updated 2 months ago
- ☆128Updated last month
- ☆55Updated 7 months ago
- ☆109Updated 3 months ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆50Updated last month
- ☆302Updated 3 weeks ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Updated 2 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆63Updated 10 months ago
- ☆35Updated 9 months ago
- ☆114Updated 3 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆118Updated 2 months ago
- A Collection of Papers on Diffusion Language Models☆149Updated 3 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 10 months ago
- ☆365Updated 2 months ago
- ☆126Updated this week
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆68Updated 3 weeks ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆97Updated 3 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆94Updated 7 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆215Updated 2 months ago
- Multimodal RewardBench☆58Updated 10 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 11 months ago
- Official Repository of LatentSeek☆73Updated 7 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆357Updated 7 months ago
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆40Updated 2 months ago
- MiroTrain is an efficient and algorithm-first framework for post-training large agentic models.☆100Updated 4 months ago
- ☆82Updated last month
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆40Updated 6 months ago