[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆59Feb 22, 2026Updated last week
Alternatives and similar repositories for dParallel
Users that are interested in dParallel are comparing it to the libraries listed below
Sorting:
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆130May 22, 2025Updated 9 months ago
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆30Jan 27, 2026Updated last month
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆58Feb 22, 2026Updated last week
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated last year
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated 11 months ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆115Jul 9, 2025Updated 7 months ago
- [Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation☆21Oct 1, 2025Updated 5 months ago
- ☆35Jun 14, 2025Updated 8 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Nov 15, 2024Updated last year
- A Collection of Papers on Diffusion Large Language Models☆39Feb 20, 2026Updated last week
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆30Mar 28, 2024Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Mar 21, 2023Updated 2 years ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- ☆42Sep 5, 2023Updated 2 years ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Nov 25, 2025Updated 3 months ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆63Jan 26, 2026Updated last month
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Dec 22, 2025Updated 2 months ago
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆435Jan 28, 2026Updated last month
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆333Dec 15, 2025Updated 2 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆116Dec 30, 2025Updated 2 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆66Jan 13, 2026Updated last month
- DCPO: Dynamic Adaptive Clipping for RL☆45Dec 20, 2025Updated 2 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆155Jan 19, 2026Updated last month
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆56Sep 19, 2024Updated last year
- ☆47Oct 2, 2025Updated 4 months ago
- An open-source reinforcement learning framework for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-turn reasoning, tool us…☆284Feb 3, 2026Updated 3 weeks ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆50Nov 4, 2025Updated 3 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆253Sep 26, 2025Updated 5 months ago
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆90Dec 27, 2025Updated 2 months ago
- (ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"☆74Feb 13, 2025Updated last year
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆52Dec 7, 2025Updated 2 months ago
- Twinkle✨: Training workbench to make your model glow.☆45Updated this week
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆41Sep 9, 2025Updated 5 months ago
- ☆55Jun 4, 2025Updated 8 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 5 months ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆81Jul 23, 2024Updated last year
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆36Nov 17, 2024Updated last year
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆160Dec 1, 2025Updated 3 months ago