Easy and Efficient dLLM Fine-Tuning
☆239Mar 2, 2026Updated last month
Alternatives and similar repositories for dFactory
Users that are interested in dFactory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 8 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 6 months ago
- dInfer: An Efficient Inference Framework for Diffusion Language Models☆449Feb 11, 2026Updated 2 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated 10 months ago
- ☆55Jun 4, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆61Feb 22, 2026Updated last month
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆430Jan 26, 2026Updated 2 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 5 months ago
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆51Feb 12, 2026Updated last month
- Dream 7B, a large diffusion language model☆1,211Nov 21, 2025Updated 4 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆36Updated this week
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆72Feb 7, 2026Updated 2 months ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆486Jan 28, 2026Updated 2 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆915Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- ☆37Oct 9, 2025Updated 6 months ago
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆164Feb 16, 2026Updated last month
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 4 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated 3 weeks ago
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆346Mar 16, 2026Updated 3 weeks ago
- ☆39Aug 28, 2025Updated 7 months ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆201Dec 17, 2025Updated 3 months ago
- Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆565Mar 1, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆112Feb 4, 2026Updated 2 months ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆64Mar 5, 2026Updated last month
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆31May 19, 2025Updated 10 months ago
- ☆120Mar 18, 2026Updated 3 weeks ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆109Feb 3, 2026Updated 2 months ago
- Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models", ICLR 2026☆75Mar 16, 2026Updated 3 weeks ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆162Jan 19, 2026Updated 2 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆47Oct 20, 2025Updated 5 months ago
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆97Apr 3, 2026Updated last week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆811Jul 9, 2025Updated 9 months ago
- Accepted at IJCAI-2022☆11Sep 3, 2022Updated 3 years ago
- Official Implementation of wd1☆25Sep 25, 2025Updated 6 months ago
- ☆17Apr 9, 2025Updated last year
- A collection of papers on discrete diffusion models☆166Mar 9, 2026Updated last month
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129May 22, 2025Updated 10 months ago