inclusionAI / dFactoryLinks
Easy and Efficient dLLM Fine-Tuning
☆181Updated last week
Alternatives and similar repositories for dFactory
Users that are interested in dFactory are comparing it to the libraries listed below
Sorting:
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆372Updated last week
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆148Updated 6 months ago
- ☆108Updated 3 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆189Updated last month
- ☆106Updated this week
- [ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.☆102Updated last year
- The official github repo for "Diffusion Language Models are Super Data Learners".☆215Updated last month
- ☆114Updated 3 months ago
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆490Updated last month
- Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?☆119Updated last year
- ☆62Updated 5 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆126Updated 7 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆216Updated 3 months ago
- [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization☆105Updated 6 months ago
- ☆126Updated 6 months ago
- ☆85Updated last month
- LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.☆198Updated last week
- ☆79Updated last month
- ☆89Updated 6 months ago
- Geometric-Mean Policy Optimization☆95Updated last month
- dParallel: Learnable Parallel Decoding for dLLMs☆50Updated 2 months ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆352Updated 6 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 11 months ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆187Updated 5 months ago
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆306Updated last month
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆95Updated 3 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆213Updated 11 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 9 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Updated last month
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆170Updated 6 months ago