zhijie-group / LoPALinks

LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

☆22

Alternatives and similar repositories for LoPA

Users that are interested in LoPA are comparing it to the libraries listed below

Sorting:

pixeli99 / Prophet
Official implementation of "Diffusion Language Models Know the Answer Before Decoding"
☆42Updated 4 months ago
Karine-Huang / GenMAC
☆30Updated last year
czg1225 / VeriThinker
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆63Updated 3 months ago
FrankYang-17 / Mavors
☆15Updated 7 months ago
EnVision-Research / TiViBench
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models
☆64Updated last month
showlab / UniRL
The code repository of UniRL
☆47Updated 7 months ago
InternLM / ARM-Thinker
Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"
☆72Updated last month
GaryStack / MMR-V
Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?
☆36Updated 6 months ago
thuml / MiniVeo3-Reasoner
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…
☆199Updated 2 months ago
TencentARC / Video-Holmes
Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
☆85Updated 5 months ago
HongbangYuan / OmniReward
☆35Updated 3 weeks ago
KlingTeam / PhysMaster
Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning
☆56Updated 2 months ago
Tiezheng11 / Vision-Language-Vision
☆63Updated 5 months ago
ByteDance-Seed / Seed-1.8
☆180Updated 2 weeks ago
zhijie-group / UniCMs
☆39Updated 7 months ago
mlvlab / DeepVideoR1
[NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"
☆31Updated last month
multimodal-reasoning-lab / Bagel-Zebra-CoT
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆111Updated 2 months ago
INV-WZQ / SparseD
[Arxiv 2025] SparseD: Sparse Attention for Diffusion Language Models
☆53Updated 3 months ago
TencentARC / GRPO-CARE
☆80Updated 6 months ago
TencentARC / DSR_Suite
☆48Updated last week
qishisuren123 / AnyCap
A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…
☆52Updated 5 months ago
tongjingqi / Thinking-with-Video
We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…
☆228Updated last week
aim-uofa / dLLM-MidTruth
☆57Updated 4 months ago
ThinkMorph / ThinkMorph
The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
☆136Updated 2 weeks ago
ali-vilab / TTS-VAR
Test-time Scaling for VAR models
☆28Updated 3 months ago
AntResearchNLP / ViLaSR
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
☆85Updated 5 months ago
M-E-AGI-Lab / Muddit
Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.
☆96Updated this week
OpenIXCLab / CODA
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning
☆32Updated 4 months ago
Yu-xm / Unicorn
Text-Only Data Synthesis for Vision Language Model Training
☆22Updated 6 months ago
path2generalist / General-Level
On Path to Multimodal Generalist: General-Level and General-Bench
☆19Updated 5 months ago