Easy and Efficient dLLM Fine-Tuning
☆261Mar 2, 2026Updated 3 months ago
Alternatives and similar repositories for dFactory
Users that are interested in dFactory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 11 months ago
- ☆28Aug 19, 2025Updated 10 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 9 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated last year
- ☆55Apr 14, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆63Apr 12, 2026Updated 2 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆446Jan 26, 2026Updated 5 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆54Nov 4, 2025Updated 7 months ago
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆56May 5, 2026Updated last month
- Frequently updated list of dLLM (Diffusion Large Language Models) papers, models, and other resources☆48Jun 17, 2026Updated 2 weeks ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆220Dec 17, 2025Updated 6 months ago
- Dream 7B, a large diffusion language model☆1,249Nov 21, 2025Updated 7 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆77Feb 7, 2026Updated 4 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆38Apr 25, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆510Jan 28, 2026Updated 5 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆1,050May 30, 2026Updated last month
- ☆37Oct 9, 2025Updated 8 months ago
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆45Aug 10, 2025Updated 10 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆58Mar 12, 2026Updated 3 months ago
- ☆40May 9, 2026Updated last month
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆66Mar 5, 2026Updated 3 months ago
- Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆636May 31, 2026Updated last month
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆31May 19, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆118Apr 13, 2026Updated 2 months ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆118Feb 3, 2026Updated 4 months ago
- Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models", ICLR 2026☆88May 4, 2026Updated last month
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆169Jan 19, 2026Updated 5 months ago
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆29May 3, 2025Updated last year
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆48Oct 20, 2025Updated 8 months ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆17Dec 30, 2024Updated last year
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆825Jul 9, 2025Updated 11 months ago
- Accepted at IJCAI-2022☆11Sep 3, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation of wd1☆31Sep 25, 2025Updated 9 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,842Updated this week
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆134May 22, 2025Updated last year
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)☆1,656Feb 14, 2026Updated 4 months ago
- [ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear at…☆103Jun 14, 2024Updated 2 years ago
- [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆395May 31, 2025Updated last year
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆26Dec 20, 2024Updated last year