NVlabs / NFTLinks
Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasoning"
☆18Updated last week
Alternatives and similar repositories for NFT
Users that are interested in NFT are comparing it to the libraries listed below
Sorting:
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆38Updated last month
- ☆152Updated last week
- ☆30Updated 2 months ago
- Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.☆60Updated 3 weeks ago
- ✈️ Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆69Updated 2 months ago
- ☆37Updated last month
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆13Updated 7 months ago
- [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization☆71Updated 3 weeks ago
- ☆35Updated 2 weeks ago
- [ICML 2025 Spotlight] Direct Discriminative Optimization: Supercharging Diffusion/Autoregressive with GAN-type Discrimination☆47Updated this week
- Open-Pandora: On-the-fly Control Video Generation☆34Updated 6 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆33Updated 4 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆64Updated last month
- ☆45Updated last week
- ☆37Updated last month
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆113Updated last week
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆76Updated 6 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆37Updated 3 months ago
- The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"☆35Updated last week
- Code release for paper "Test-Time Training Done Right"☆149Updated last week
- ☆85Updated 2 months ago
- A Collection of Papers on Diffusion Language Models☆81Updated last week
- Autoregressive Image Generation with Randomized Parallel Decoding☆67Updated 2 months ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆49Updated 3 months ago
- Multimodal RewardBench☆41Updated 4 months ago
- ☆44Updated 5 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆35Updated 5 months ago
- VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆51Updated 3 weeks ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆96Updated last week
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆45Updated last month