Reinforcement Learning of Vision Language Models with Self Visual Perception Reward
☆161Sep 23, 2025Updated 5 months ago
Alternatives and similar repositories for Vision-SR1
Users that are interested in Vision-SR1 are comparing it to the libraries listed below
Sorting:
- Synthetic Video hallucination and Mitigation☆18Sep 21, 2025Updated 5 months ago
- An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluatio…☆61Jul 18, 2025Updated 7 months ago
- ☆16Jan 30, 2022Updated 4 years ago
- ☆18Jun 10, 2025Updated 8 months ago
- grpo to train long form QA and instructions with long-form reward model