OpenSparseLLMs / Open-PandoraLinks
Open-Pandora: On-the-fly Control Video Generation
☆35Updated last year
Alternatives and similar repositories for Open-Pandora
Users that are interested in Open-Pandora are comparing it to the libraries listed below
Sorting:
- 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆91Updated last year
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆59Updated last year
- ☆118Updated 4 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆91Updated 5 months ago
- ☆201Updated last month
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆71Updated 5 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆109Updated 8 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆71Updated 6 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆95Updated 8 months ago
- ☆63Updated 6 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Updated last year
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆28Updated 7 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆164Updated 4 months ago
- ☆39Updated 6 months ago
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆80Updated 6 months ago
- Multimodal RewardBench☆60Updated 11 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆147Updated 9 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆89Updated 11 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆131Updated 9 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆178Updated 7 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆172Updated 2 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆64Updated 3 months ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆62Updated 8 months ago
- ☆134Updated last week
- ☆73Updated 6 months ago
- Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆140Updated last month
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Updated last month
- Large Language Models Can Self-Improve in Long-context Reasoning☆72Updated last year
- ☆110Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆51Updated 6 months ago