AuroraZengfh / RobustMergeLinks
[NeurIPS'25 Spotlightπ₯] Official Implementation of RobustMerge: Parameter-Efficient Model Merging for MLLMs with Direction Robustness
β56Updated last month
Alternatives and similar repositories for RobustMerge
Users that are interested in RobustMerge are comparing it to the libraries listed below
Sorting:
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]β179Updated 7 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvementβ129Updated 6 months ago
- Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"β123Updated last month
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentationβ104Updated 4 months ago
- β110Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*β109Updated 8 months ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)β88Updated 4 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPOβ74Updated 3 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Timeβ89Updated 7 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.β69Updated last year
- Code for Heimaβ59Updated 9 months ago
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.β174Updated 4 months ago
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architectureβ212Updated last year
- Official Repository of LatentSeekβ76Updated 7 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Modelsβ84Updated 3 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.β89Updated 11 months ago
- β107Updated 7 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"β37Updated last year
- [ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Modelsβ94Updated last year
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Rewardβ91Updated 5 months ago
- JudgeLRM: Large Reasoning Models as a Judgeβ40Updated last month
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Modelsβ53Updated 4 months ago
- [MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."β145Updated 6 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."β51Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuningβ90Updated last year
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cuesβ44Updated 8 months ago
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Modelsβ149Updated 3 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Frameworkβ71Updated 8 months ago
- Research works from Tencent AI Lab regarding self-evolving agentsβ81Updated this week
- [EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMsβ59Updated 5 months ago