Easy and Efficient dLLM Fine-Tuning
☆251Mar 2, 2026Updated 2 months ago
Alternatives and similar repositories for dFactory
Users that are interested in dFactory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 10 months ago
- ☆28Aug 19, 2025Updated 9 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 7 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated 11 months ago
- ☆55Apr 14, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- dInfer: An Efficient Inference Framework for Diffusion Language Models☆460Feb 11, 2026Updated 3 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆61Apr 12, 2026Updated last month
- ☆16Sep 22, 2024Updated last year
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆443Jan 26, 2026Updated 3 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 6 months ago
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆54May 5, 2026Updated 2 weeks ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆211Dec 17, 2025Updated 5 months ago
- Dream 7B, a large diffusion language model☆1,237Nov 21, 2025Updated 6 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆75Feb 7, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Apr 25, 2026Updated 3 weeks ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆501Jan 28, 2026Updated 3 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆973Updated this week
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆17Dec 19, 2024Updated last year
- Benchmarking agent reasoning capabilities in physical interactions, tool usage, and multi-agent coordination.☆45Aug 10, 2025Updated 9 months ago
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆166Feb 16, 2026Updated 3 months ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆23Dec 2, 2025Updated 5 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Mar 12, 2026Updated 2 months ago
- ICLR 2026☆42May 12, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆353Mar 16, 2026Updated 2 months ago
- ☆40May 9, 2026Updated last week
- Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.☆93Oct 26, 2025Updated 6 months ago
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆113Apr 13, 2026Updated last month
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆64Mar 5, 2026Updated 2 months ago
- Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆602May 11, 2026Updated last week
- ☆123Mar 18, 2026Updated 2 months ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆114Feb 3, 2026Updated 3 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆167Jan 19, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models", ICLR 2026☆82May 4, 2026Updated 2 weeks ago
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆29May 3, 2025Updated last year
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆48Oct 20, 2025Updated 7 months ago
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆265Updated this week
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation☆820Jul 9, 2025Updated 10 months ago
- Accepted at IJCAI-2022☆11Sep 3, 2022Updated 3 years ago