OpenMOSS/LongLLaDA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenMOSS/LongLLaDA)

OpenMOSS / LongLLaDA

[AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

☆55

Alternatives and similar repositories for LongLLaDA

Users that are interested in LongLLaDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Relaxed-System-Lab / UltraLLaDA
View on GitHub
We introduce UltraLLaDA , a scaled variant of LLaDA-8B-Base that extends the context length up to 128K tokens with light-weight post-trai…
☆15Oct 23, 2025Updated 8 months ago
OpenMOSS / Sparse-dLLM
View on GitHub
☆29Oct 16, 2025Updated 9 months ago
OpenMOSS / rope_pp
View on GitHub
[ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
☆33Dec 9, 2025Updated 7 months ago
ML-GSAI / LLaDA-1.5
View on GitHub
☆55Apr 14, 2026Updated 3 months ago
czg1225 / dParallel
View on GitHub
[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆65Apr 12, 2026Updated 3 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
dllm-reasoning / d1
View on GitHub
Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"
☆454Jan 26, 2026Updated 5 months ago
DreamLM / DreamOn
View on GitHub
Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas
☆118Feb 3, 2026Updated 5 months ago
INV-WZQ / SparseD
View on GitHub
[ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models
☆67Feb 22, 2026Updated 4 months ago
autonomousvision / mdpo
View on GitHub
MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models
☆45Jan 28, 2026Updated 5 months ago
Li-Jinsong / DAEDAL
View on GitHub
[ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"
☆173Feb 16, 2026Updated 5 months ago
horseee / dKV-Cache
View on GitHub
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆135May 22, 2025Updated last year
tengxiaoliu / LM_skip
View on GitHub
[NeurIPS 2024] Can Language Models Learn to Skip Steps?
☆21Jan 25, 2025Updated last year
NVlabs / Fast-dLLM
View on GitHub
Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"
☆1,063May 30, 2026Updated last month
xinghaow99 / pbs-attn
View on GitHub
[ICML 2026] Sparser Block-Sparse Attention via Token Permutation
☆31May 22, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ML-GSAI / Diffusion-LLM-Papers
View on GitHub
A Collection of Papers on Diffusion Language Models
☆180Sep 15, 2025Updated 10 months ago
xiaohangt / wd1
View on GitHub
Official Implementation of wd1
☆32Sep 25, 2025Updated 9 months ago
yxzwang / PhenomNN
View on GitHub
Codes for Paper: From Hypergraph Energy Functions to Hypergraph Neural Networks
☆23Jun 29, 2023Updated 3 years ago
OpenMOSS / Thus-Spake-Long-Context-LLM
View on GitHub
a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation
☆62Mar 31, 2025Updated last year
DreamLM / Dream-Coder
View on GitHub
☆106Nov 17, 2025Updated 8 months ago
OpenMOSS / DiRL
View on GitHub
☆165Mar 30, 2026Updated 3 months ago
INV-WZQ / LightCL
View on GitHub
[ASP-DAC 2025] LightCL: Compact Continual Learning with Low Memory Footprint For Edge Device
☆16Apr 2, 2025Updated last year
maomaocun / dLLM-cache
View on GitHub
Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…
☆211May 1, 2026Updated 2 months ago
ML-GSAI / LLaDA-o
View on GitHub
☆53May 16, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cychomatica / FreeDave
View on GitHub
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
☆23May 19, 2026Updated 2 months ago
aim-uofa / dLLM-MidTruth
View on GitHub
[ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".
☆66Mar 5, 2026Updated 4 months ago
ML-GSAI / LLaDA-V
View on GitHub
☆347Mar 23, 2026Updated 3 months ago
JinjieNi / dlms-are-super-data-learners
View on GitHub
The official github repo for "Diffusion Language Models are Super Data Learners".
☆227Nov 6, 2025Updated 8 months ago
QuenithAI / Diffusion-Large-Language-Models-Paper-List
View on GitHub
Tracking the latest and greatest research papers on diffusion large language models.
☆32Mar 13, 2026Updated 4 months ago
Auraithm / LLADA_pretraining
View on GitHub
☆31Aug 18, 2025Updated 11 months ago
LiangrunFlora / Slow-Fast-Sampling
View on GitHub
Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…
☆43Jul 18, 2025Updated last year
pengzhangzhi / Open-dLLM
View on GitHub
Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
☆645May 31, 2026Updated last month
OpenLMLab / LongWanjuan
View on GitHub
Towards Systematic Measurement for Long Text Quality
☆39Sep 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / SPG
View on GitHub
Code for paper "SPG Sandwiched Policy Gradient for Masked Diffusion Language Models"
☆62Oct 29, 2025Updated 8 months ago
INV-WZQ / SAMCL
View on GitHub
[AAAI 2026 Oral] SAMCL: Empowering SAM to Continually Learn from Dynamic Domains with Extreme Storage Efficiency
☆21Mar 25, 2026Updated 3 months ago
PPPP-kaqiu / Awesome-Parallel-Reasoning
View on GitHub
Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.
☆54Mar 8, 2026Updated 4 months ago
OpenLMLab / scaling-rope
View on GitHub
code for Scaling Laws of RoPE-based Extrapolation
☆73Oct 16, 2023Updated 2 years ago
ZhanqiuHu / flash-dlm-experimental
View on GitHub
Implementation of Flash-DLM (paper: FlashDLM: Accelerating Diffusion Language Models via Efficient KV Caching and Guided Diffusion). Prov…
☆24Nov 25, 2025Updated 7 months ago
hrlics / HoPE
View on GitHub
[NeurIPS 2025] HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
☆29Feb 19, 2026Updated 5 months ago
OpenMOSS / Embodied-Planner-R1
View on GitHub
Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
☆27Mar 30, 2026Updated 3 months ago