inclusionAI / dFactoryLinks
Easy and Efficient dLLM Fine-Tuning
☆131Updated last week
Alternatives and similar repositories for dFactory
Users that are interested in dFactory are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆186Updated 2 weeks ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆136Updated 5 months ago
- [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization☆104Updated 6 months ago
- ☆104Updated 2 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆120Updated 6 months ago
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆339Updated 2 weeks ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 10 months ago
- Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?☆118Updated last year
- ☆90Updated last week
- Geometric-Mean Policy Optimization☆95Updated 2 weeks ago
- The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆116Updated last week
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆205Updated 2 months ago
- ☆61Updated 4 months ago
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆469Updated 3 weeks ago
- ☆105Updated 5 months ago
- ☆110Updated 2 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆328Updated 6 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆168Updated 6 months ago
- dParallel: Learnable Parallel Decoding for dLLMs☆42Updated last month
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 9 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆160Updated 2 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆42Updated last month
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆64Updated 2 months ago
- [ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.☆99Updated 11 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆80Updated 2 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆207Updated last month
- A Collection of Papers on Diffusion Language Models☆147Updated 2 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆41Updated 5 months ago
- ☆85Updated 3 weeks ago
- ☆54Updated 6 months ago