hao-ai-lab / d3LLMLinks
d3LLM: Ultra-Fast Diffusion LLM π
β49Updated 3 weeks ago
Alternatives and similar repositories for d3LLM
Users that are interested in d3LLM are comparing it to the libraries listed below
Sorting:
- β126Updated this week
- SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable SparseβLinear Attentionβ221Updated last week
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Modelsβ127Updated 7 months ago
- β213Updated last month
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inferenceβ224Updated 3 months ago
- Easy and Efficient dLLM Fine-Tuningβ190Updated 3 weeks ago
- A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and cachβ¦β52Updated 2 months ago
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoringβ263Updated 6 months ago
- βοΈ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraintsβ78Updated 5 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cacheβ¦β191Updated last month
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Modelsβ380Updated 3 weeks ago
- β62Updated 6 months ago
- The most open diffusion language model for code generation β releasing pretraining, evaluation, inference, and checkpoints.β496Updated last month
- A sparse attention kernel supporting mix sparse patternsβ423Updated 3 weeks ago
- A lightweight Inference Engine built for block diffusion modelsβ38Updated 3 weeks ago
- [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafterβ120Updated last month
- Efficient triton implementation of Native Sparse Attention.β257Updated 7 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.β148Updated 6 months ago
- β31Updated 3 months ago
- A Collection of Papers on Diffusion Language Modelsβ149Updated 3 months ago
- β82Updated last month
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsityβ64Updated 6 months ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".β50Updated last week
- β188Updated 11 months ago
- β90Updated 6 months ago
- Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inferenceβ160Updated 2 months ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compressionβ50Updated 2 months ago
- [NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Tokβ¦β71Updated this week
- β114Updated 3 months ago
- Code for Draft Attentionβ98Updated 7 months ago