Easy and Efficient dLLM Fine-Tuning
☆235Mar 2, 2026Updated 2 weeks ago
Alternatives and similar repositories for dFactory
Users that are interested in dFactory are comparing it to the libraries listed below
Sorting:
- The officalimplement of dLLM-Factory☆26Jul 12, 2025Updated 8 months ago
- ☆25Aug 19, 2025Updated 7 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆39Sep 30, 2025Updated 5 months ago
- dInfer: An Efficient Inference Framework for Diffusion Language Models☆434Feb 11, 2026Updated last month
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated 9 months ago
- ☆55Jun 4, 2025Updated 9 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆62Feb 22, 2026Updated 3 weeks ago
- ☆15Sep 22, 2024Updated last year
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆419Jan 26, 2026Updated last month
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 4 months ago
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆48Feb 12, 2026Updated last month
- Dream 7B, a large diffusion language model☆1,198Nov 21, 2025Updated 4 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆70Feb 7, 2026Updated last month
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆36Jan 16, 2026Updated 2 months ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆464Jan 28, 2026Updated last month
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆883Jan 28, 2026Updated last month
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆43Aug 10, 2025Updated 7 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated last week
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- ☆37Oct 9, 2025Updated 5 months ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆21Dec 2, 2025Updated 3 months ago
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆342Updated this week
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆199Dec 17, 2025Updated 3 months ago
- ☆35Feb 15, 2026Updated last month
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆106Feb 3, 2026Updated last month
- Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆554Mar 1, 2026Updated 2 weeks ago
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆106Feb 4, 2026Updated last month
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆63Mar 5, 2026Updated 2 weeks ago
- ☆121Jan 8, 2026Updated 2 months ago
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆31May 19, 2025Updated 10 months ago
- Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models", ICLR 2026☆75Updated this week
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆156Jan 19, 2026Updated 2 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆46Oct 20, 2025Updated 5 months ago
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆28May 3, 2025Updated 10 months ago
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆803Jul 9, 2025Updated 8 months ago
- Accepted at IJCAI-2022☆11Sep 3, 2022Updated 3 years ago
- ☆17Apr 9, 2025Updated 11 months ago
- A collection of papers on discrete diffusion models☆167Mar 9, 2026Updated last week
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆130May 22, 2025Updated 10 months ago