waltonfuture / MM-UPTLinks
Unsupervised GRPO
☆24Updated this week
Alternatives and similar repositories for MM-UPT
Users that are interested in MM-UPT are comparing it to the libraries listed below
Sorting:
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 2 months ago
- ☆19Updated 3 weeks ago
- Your efficient and accurate answer verification system for RL training.☆12Updated this week
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆36Updated last week
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆44Updated 2 weeks ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆36Updated last week
- ☆42Updated 6 months ago
- ☆14Updated last month
- ☆16Updated 10 months ago
- ☆17Updated 5 months ago
- ☆12Updated 3 weeks ago
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆51Updated last week
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆43Updated 3 months ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"☆25Updated 3 weeks ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆15Updated 3 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆67Updated last year
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆64Updated 2 weeks ago
- Official Repository of LatentSeek☆30Updated last week
- ☆16Updated 4 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆60Updated 5 months ago
- ☆18Updated 2 months ago
- Fast-Slow Thinking for Large Vision-Language Model Reasoning☆14Updated last month
- ☆19Updated 3 months ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆15Updated last month
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆21Updated 5 months ago
- ☆22Updated 10 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Updated 8 months ago
- PyTorch implementation of StableMask (ICML'24)☆13Updated 11 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆25Updated last month
- ☆45Updated 3 months ago