SJTU-DENG-Lab / LoPALinks
LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
☆33Updated 2 weeks ago
Alternatives and similar repositories for LoPA
Users that are interested in LoPA are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆64Updated 4 months ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆43Updated 4 months ago
- The code repository of UniRL☆51Updated 8 months ago
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆47Updated 6 months ago
- Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆79Updated last month
- ☆39Updated 8 months ago
- ☆63Updated 6 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆29Updated 3 weeks ago
- ☆204Updated last month
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Updated 3 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆206Updated 3 months ago
- ☆80Updated 7 months ago
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆117Updated 2 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆47Updated 2 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Updated last year
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆56Updated 3 months ago
- ☆62Updated 2 months ago
- More reliable Video Understanding Evaluation☆13Updated 4 months ago
- ☆40Updated 2 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆37Updated 7 months ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆20Updated 6 months ago
- Text-Only Data Synthesis for Vision Language Model Training☆23Updated 7 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆86Updated 6 months ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆33Updated last year
- ☆61Updated 3 weeks ago
- [AAAI 2026] GenMAC for Compositional Text-to-Video Generation☆31Updated 3 weeks ago
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆205Updated this week
- Quick Long Video Understanding [TMLR2025]☆74Updated 3 months ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆48Updated last week
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆114Updated 6 months ago