haonan3 / V1Links
V1: Toward Multimodal Reasoning by Designing Auxiliary Task
☆36Updated 7 months ago
Alternatives and similar repositories for V1
Users that are interested in V1 are comparing it to the libraries listed below
Sorting:
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆59Updated last year
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆81Updated last month
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation