[ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs
☆61Apr 12, 2026Updated last month
Alternatives and similar repositories for dParallel
Users that are interested in dParallel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆130May 22, 2025Updated 11 months ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated last year
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆117Jul 9, 2025Updated 10 months ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆65Feb 22, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆33Jan 27, 2026Updated 3 months ago
- [Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation☆21Oct 1, 2025Updated 7 months ago
- A Collection of Papers on Diffusion Large Language Models☆47May 12, 2026Updated last week
- [ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.☆501Jan 28, 2026Updated 3 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- Sequential Diffusion Language Model (SDLM) enhances pre-trained autoregressive language models by adaptively determining generation lengt…☆97Dec 27, 2025Updated 4 months ago
- ☆42Sep 5, 2023Updated 2 years ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆45Nov 25, 2025Updated 5 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆167Jan 19, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Mar 21, 2023Updated 3 years ago
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆31Mar 28, 2024Updated 2 years ago
- SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)☆353Mar 16, 2026Updated 2 months ago
- Official implementation of Categorical Flow Maps on text.☆57Feb 16, 2026Updated 3 months ago
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆57Sep 19, 2024Updated last year
- A fake presidential speech generator with a Mad Libs element.☆10Jul 19, 2017Updated 8 years ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆69Apr 4, 2026Updated last month
- ☆55Apr 14, 2026Updated last month
- Easy and Efficient dLLM Fine-Tuning☆251Mar 2, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 为 RWKV 设计的「Deep Think」实现。☆27Dec 7, 2025Updated 5 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- This project involved the analysis of the ArXiv citation network.☆15Jan 29, 2022Updated 4 years ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆53Dec 22, 2025Updated 4 months ago
- Continuous Pipelined Speculative Decoding☆20Updated this week
- (ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"☆78Feb 13, 2025Updated last year
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆67Jan 26, 2026Updated 3 months ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆32May 23, 2022Updated 3 years ago
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆14Nov 28, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆37Sep 20, 2025Updated 8 months ago
- ☆49May 14, 2026Updated last week
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆26Mar 10, 2026Updated 2 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆258Sep 26, 2025Updated 7 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆107Feb 26, 2024Updated 2 years ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"? [ICLR26]☆39Jun 23, 2025Updated 10 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆44Nov 8, 2024Updated last year