maomaocun / dLLM-VarLinks
The official implementation of dLLM-Var
☆31Updated 3 months ago
Alternatives and similar repositories for dLLM-Var
Users that are interested in dLLM-Var are comparing it to the libraries listed below
Sorting:
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆154Updated 3 weeks ago
- [ICLR 2026] TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆423Updated 2 weeks ago
- A Collection of Papers on Diffusion Language Models☆155Updated 4 months ago
- ☆145Updated 3 weeks ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆45Updated 3 months ago
- [ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆143Updated 2 weeks ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆45Updated 5 months ago
- ☆37Updated 5 months ago
- ☆318Updated last month
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆40Updated 6 months ago
- 📚 Collection of token-level model compression resources.☆190Updated 5 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆197Updated 2 months ago
- Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".☆115Updated 2 weeks ago
- Official Repository of LatentSeek☆76Updated 8 months ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆104Updated last year
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆58Updated 2 weeks ago
- Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"☆36Updated last month
- [TMLR 2025] Efficient Reasoning Models: A Survey☆298Updated last week
- d3LLM: Ultra-Fast Diffusion LLM 🚀☆90Updated last week
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129Updated 8 months ago
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆519Updated 3 months ago
- Code release for VTW (AAAI 2025 Oral)☆64Updated 3 months ago
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆103Updated 4 months ago
- A lightweight Inference Engine built for block diffusion models☆40Updated 2 months ago
- GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 tr…☆322Updated 3 months ago
- Paper List of Inference/Test Time Scaling/Computing☆344Updated 5 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆91Updated 11 months ago
- ScalingOpt - Optimization Community☆78Updated last week
- ☆33Updated 8 months ago
- Easy and Efficient dLLM Fine-Tuning☆209Updated 3 weeks ago