weijiawu / Awesome-Visual-Reinforcement-LearningLinks

📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.

☆339

Alternatives and similar repositories for Awesome-Visual-Reinforcement-Learning

Users that are interested in Awesome-Visual-Reinforcement-Learning are comparing it to the libraries listed below

Sorting:

mll-lab-nu / Awesome-Spatial-Intelligence-in-VLM
A paper list for spatial reasoning
☆411Updated last week
cambrian-mllm / cambrian-s
Cambrian-S: Towards Spatial Supersensing in Video
☆375Updated 2 weeks ago
UMass-Embodied-AGI / Mirage
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)
☆194Updated 3 months ago
tulerfeng / Awesome-Embodied-Multimodal-LLMs
Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).
☆121Updated last year
mit-han-lab / vila-u
[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
☆410Updated 7 months ago
PzySeere / MetaSpatial
MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …
☆193Updated 6 months ago
TIGER-AI-Lab / Pixel-Reasoner
Pixel-Level Reasoning Model trained with RL [NeuIPS25]
☆251Updated 3 weeks ago
vision-x-nyu / thinking-in-space
Official repo and evaluation implementation of VSI-Bench
☆638Updated 3 months ago
Mini-o3 / Mini-o3
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
☆368Updated 2 months ago
rongyaofang / GoT
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆297Updated 2 months ago
aim-uofa / Omni-R1
[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
☆92Updated 5 months ago
tanhuajie / Reason-RFT
[NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
☆233Updated last month
yix8 / VisualPlanning
Visual Planning: Let's Think Only with Images
☆281Updated 6 months ago
Gabesarch / grounded-rl
☆104Updated 4 months ago
NVlabs / Long-RL
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
☆664Updated 2 months ago
alibaba-damo-academy / RynnVLA-002
WorldVLA: Towards Autoregressive Action World Model
☆560Updated this week
aim-uofa / Active-o3
ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
☆75Updated last week
OuyangKun10 / SpaceR
SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning
☆98Updated 4 months ago
EvolvingLMMs-Lab / EgoLife
[CVPR 2025] EgoLife: Towards Egocentric Life Assistant
☆348Updated 8 months ago
Video-R1 / Awesome-Multimodal-Reasoning
Collections of Papers and Projects for Multimodal Reasoning.
☆105Updated 7 months ago
M-E-AGI-Lab / Awesome-World-Models
Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.
☆55Updated last month
dvlab-research / VisionReasoner
Vision Manus: Your versatile Visual AI assistant
☆300Updated last month
ML-GSAI / LLaDA-V
☆283Updated last month
mll-lab-nu / TStar
TStar is a unified temporal search framework for long-form video question answering
☆71Updated 2 months ago
dongyh20 / Insight-V
[CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
☆230Updated 3 weeks ago
TencentARC / TokLIP
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
☆231Updated 3 months ago
linkangheng / PR1
[NeurIPS 2025] Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning
☆270Updated 4 months ago
Purshow / Awesome-Unified-Multimodal
📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.
☆331Updated last month
multimodal-reasoning-lab / Bagel-Zebra-CoT
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆103Updated 3 weeks ago
yunlong10 / Awesome-Video-LMM-Post-Training
🔥🔥🔥 Latest Papers, Codes and Datasets on Video-LMM Post-Training
☆181Updated last week