NVlabs / NFTLinks
Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasoning"
☆65Updated 3 months ago
Alternatives and similar repositories for NFT
Users that are interested in NFT are comparing it to the libraries listed below
Sorting:
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆347Updated last week
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆113Updated last month
- ☆62Updated last month
- ☆105Updated 3 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆48Updated 4 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 10 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆208Updated last month
- P1: Mastering Physics Olympiads with Reinforcement Learning☆67Updated 3 weeks ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 9 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆123Updated 8 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆59Updated 9 months ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆57Updated last year
- ☆112Updated 2 months ago
- Official Repository of LatentSeek☆70Updated 6 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆90Updated 6 months ago
- Easy and Efficient dLLM Fine-Tuning☆139Updated last week
- ☆293Updated 2 months ago
- Geometric-Mean Policy Optimization☆95Updated 3 weeks ago
- Multimodal RewardBench☆55Updated 9 months ago
- ☆121Updated 2 weeks ago
- LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆46Updated last week
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆123Updated 6 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆347Updated 6 months ago
- A repo for open research on building large reasoning models☆117Updated 3 weeks ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆139Updated 5 months ago
- MiroTrain is an efficient and algorithm-first framework for post-training large agentic models.☆99Updated 3 months ago
- A Collection of Papers on Diffusion Language Models☆148Updated 2 months ago
- ☆55Updated 6 months ago
- ☆70Updated 5 months ago
- ☆62Updated 5 months ago