zhangquanchen / VisRLLinks
[ICCV 2025] VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
☆38Updated 2 months ago
Alternatives and similar repositories for VisRL
Users that are interested in VisRL are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆111Updated last month
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆47Updated last month
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆49Updated 2 months ago
- Official code for paper "GRIT: Teaching MLLMs to Think with Images"☆121Updated 3 weeks ago
- Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆62Updated 3 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆190Updated last month
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding☆97Updated this week
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models