ML-GSAI / LLaDA-1.5View external linksLinks
☆55Jun 4, 2025Updated 8 months ago
Alternatives and similar repositories for LLaDA-1.5
Users that are interested in LLaDA-1.5 are comparing it to the libraries listed below
Sorting:
- ☆37Aug 28, 2025Updated 5 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129May 22, 2025Updated 8 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆56Nov 5, 2025Updated 3 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆402Jan 26, 2026Updated 2 weeks ago
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆40Jul 18, 2025Updated 6 months ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆50Jan 23, 2026Updated 3 weeks ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆59Feb 6, 2026Updated last week
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated 3 weeks ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆52Dec 7, 2025Updated 2 months ago
- code for promptCSE, emnlp 2022☆11Apr 10, 2023Updated 2 years ago
- [NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"☆22Jan 3, 2026Updated last month
- ☆42Sep 15, 2025Updated 4 months ago
- Easy and Efficient dLLM Fine-Tuning☆209Jan 21, 2026Updated 3 weeks ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- ☆319Dec 16, 2025Updated last month
- [EMNLP2022] Released code for paper "Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition"☆23Feb 9, 2023Updated 3 years ago
- Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"☆36Jan 11, 2026Updated last month
- MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models☆40Jan 28, 2026Updated 2 weeks ago
- ☆32Oct 13, 2025Updated 4 months ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆833Jan 28, 2026Updated 2 weeks ago
- ☆21Dec 6, 2025Updated 2 months ago
- Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!☆33Jun 23, 2025Updated 7 months ago
- GUI for LLaDA Diffusion LLM with Quantization for low end GPU and CPU options.☆25Mar 7, 2025Updated 11 months ago
- Implementation of "Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models" [NeurIPS 2025]☆73Dec 17, 2025Updated last month
- [ICLR 2026] TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆423Jan 28, 2026Updated 2 weeks ago
- Code for paper "SPG Sandwiched Policy Gradient for Masked Diffusion Language Models"☆49Oct 29, 2025Updated 3 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Nov 2, 2024Updated last year
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆162Sep 12, 2025Updated 5 months ago
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆57Oct 7, 2025Updated 4 months ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆45Sep 8, 2025Updated 5 months ago
- Code for the paper https://arxiv.org/abs/2205.14987v2☆58Apr 18, 2024Updated last year
- ☆31Jun 12, 2024Updated last year
- SFT+RL boosts multimodal reasoning☆45Jun 27, 2025Updated 7 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- Code for Stable Control Representations☆26Apr 5, 2025Updated 10 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data" (ICLR…☆79May 30, 2025Updated 8 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆36Nov 10, 2025Updated 3 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆58Jan 26, 2026Updated 2 weeks ago