Osilly / Vision-R1

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning capability.
346Updated this week

Alternatives and similar repositories for Vision-R1:

Users that are interested in Vision-R1 are comparing it to the libraries listed below