czg1225 / dParallelLinks
dParallel: Learnable Parallel Decoding for dLLMs
☆42Updated last month
Alternatives and similar repositories for dParallel
Users that are interested in dParallel are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆62Updated 2 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆121Updated 6 months ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆86Updated 2 months ago
- The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated 3 months ago
- Code for Heima☆58Updated 7 months ago
- LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently (ICML2025 Oral)☆25Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆87Updated 9 months ago
- Data distillation benchmark☆71Updated 5 months ago
- ☆104Updated 2 months ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆112Updated 4 months ago
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆35Updated 4 months ago
- ☆61Updated 4 months ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆39Updated 2 months ago
- ☆30Updated 2 weeks ago
- Recent Advances on MLLM's Reasoning Ability☆26Updated 7 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Updated 7 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆39Updated last year
- Official implementation of "Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology"☆71Updated last month
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆49Updated last year
- Codes for Merging Large Language Models☆33Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated 2 months ago
- ☆22Updated 6 months ago
- [NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Tok…☆60Updated last week
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆27Updated 4 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Updated 6 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆86Updated 5 months ago
- ☆31Updated 3 weeks ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆42Updated last month
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Updated last year
- ☆10Updated last year