OpenMOSS / DiRLLinks
☆118Updated last week
Alternatives and similar repositories for DiRL
Users that are interested in DiRL are comparing it to the libraries listed below
Sorting:
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆224Updated 3 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆191Updated last month
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆127Updated 7 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆148Updated 6 months ago
- Easy and Efficient dLLM Fine-Tuning☆190Updated 3 weeks ago
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆380Updated 3 weeks ago
- d3LLM: Ultra-Fast Diffusion LLM 🚀☆49Updated 2 weeks ago
- ☆62Updated 6 months ago
- A Collection of Papers on Diffusion Language Models☆149Updated 3 months ago
- ☆31Updated 3 months ago
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆78Updated 5 months ago
- ☆114Updated 3 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Updated 2 months ago
- [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization☆105Updated 7 months ago
- The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆136Updated 2 weeks ago
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆64Updated 6 months ago
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆329Updated 7 months ago
- ☆109Updated 3 months ago
- ☆302Updated 3 weeks ago
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆496Updated last month
- ☆161Updated last month
- A lightweight Inference Engine built for block diffusion models☆38Updated 3 weeks ago
- [NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Tok…☆71Updated this week
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆63Updated 3 months ago
- [ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.☆104Updated last year
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆81Updated last week
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆42Updated 4 months ago
- SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention☆221Updated last week
- Fast and memory-efficient exact kmeans☆131Updated last month
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆263Updated 6 months ago