jingyi0000 / R1-VLLinks
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
☆432Updated 3 weeks ago
Alternatives and similar repositories for R1-VL
Users that are interested in R1-VL are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs☆128Updated 5 months ago
- This is the official repository for C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection☆122Updated last month
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"