Open-Reasoner-Zero/Open-Vision-Reasoner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Open-Reasoner-Zero/Open-Vision-Reasoner)

Open-Reasoner-Zero / Open-Vision-Reasoner

[NeurIPS 2025] The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning".

☆157

Alternatives and similar repositories for Open-Vision-Reasoner

Users that are interested in Open-Vision-Reasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

linkangheng / Video-UTR
View on GitHub
[ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs
☆61Feb 27, 2025Updated last year
linkangheng / PR1
View on GitHub
[NeurIPS 2025] Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning
☆289Jul 15, 2025Updated last year
stepfun-ai / Step3-VL-10B
View on GitHub
Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its…
☆409Jan 21, 2026Updated 6 months ago
WayneMao / RoboMatrix
View on GitHub
The Official Implementation of RoboMatrix
☆108May 19, 2025Updated last year
yuangpeng / dreambench_plus
View on GitHub
[ICLR 2025] Official code implementation of DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
☆138Feb 23, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Ahnsun / merlin
View on GitHub
[ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds
☆97Jul 4, 2024Updated 2 years ago
stepfun-ai / NextStep-1
View on GitHub
[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …
☆690Feb 27, 2026Updated 4 months ago
WayneMao / PillarNeSt
View on GitHub
The Official Implementation of PillarNeSt
☆54May 19, 2025Updated last year
ASTRAL-Group / AlphaOne
View on GitHub
[EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
☆89Jun 10, 2025Updated last year
micky-li-hd / CoCo
View on GitHub
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation
☆54Apr 9, 2026Updated 3 months ago
stepfun-ai / Step3
View on GitHub
☆454Aug 10, 2025Updated 11 months ago
stepfun-ai / GEBench
View on GitHub
☆53Feb 25, 2026Updated 5 months ago
Ucas-HaoranWei / Vary-tiny-600k
View on GitHub
Vary-tiny codebase upon LAVIS （for training from scratch）and a PDF image-text pairs data (about 600k including English/Chinese)
☆89Sep 21, 2024Updated last year
stepfun-ai / SteptronOss
View on GitHub
A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular…
☆578May 18, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DwanZhang-AI / SePPO
View on GitHub
Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."
☆18Oct 7, 2024Updated last year
jihaonew / MM-Instruct
View on GitHub
MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment
☆35Jul 1, 2024Updated 2 years ago
CSfufu / Revisual-R1
View on GitHub
[ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, mul…
☆212Dec 10, 2025Updated 7 months ago
NVlabs / Long-RL
View on GitHub
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
☆726Sep 24, 2025Updated 10 months ago
qizekun / OmniSpatial
View on GitHub
[ICLR 2026] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
☆88Jan 21, 2026Updated 6 months ago
yunfeixie233 / ViGaL
View on GitHub
☆70Feb 4, 2026Updated 5 months ago
suu990901 / KlearReasoner
View on GitHub
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
☆82Dec 25, 2025Updated 7 months ago
agents-x-project / TIR-Bench
View on GitHub
[ECCV 2026] Official implementation of "TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning"
☆25Feb 8, 2026Updated 5 months ago
real-absolute-AI / NoisyRollout
View on GitHub
[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆112Sep 18, 2025Updated 10 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
stepfun-ai / PaCoRe
View on GitHub
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
☆336Feb 5, 2026Updated 5 months ago
UniPat-AI / BabyVision
View on GitHub
We introduce BabyVision, a benchmark revealing the infancy of AI vision.
☆232Jan 13, 2026Updated 6 months ago
Visual-Agent / DeepEyes
View on GitHub
☆1,250Nov 20, 2025Updated 8 months ago
Ucas-HaoranWei / Slow-Perception
View on GitHub
Official code implementation of Slow Perception:Let's Perceive Geometric Figures Step-by-step
☆163Jul 28, 2025Updated 11 months ago
TIGER-AI-Lab / Pixel-Reasoner
View on GitHub
Pixel-Level Reasoning Model trained with RL [NeuIPS25]
☆301Jun 4, 2026Updated last month
tulerfeng / Video-R1
View on GitHub
Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]
☆882Dec 14, 2025Updated 7 months ago
xinyan-cxy / MINT-CoT
View on GitHub
[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
☆107Sep 19, 2025Updated 10 months ago
MiniMax-AI / One-RL-to-See-Them-All
View on GitHub
The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning
☆329May 31, 2025Updated last year
TIGER-AI-Lab / VL-Rethinker
View on GitHub
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆190Jun 5, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RunpeiDong / DreamLLM
View on GitHub
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
☆462Dec 2, 2024Updated last year
stepfun-ai / Step-3.5-Flash
View on GitHub
Fast, Sharp & Reliable Agentic Intelligence
☆2,094Apr 3, 2026Updated 3 months ago
zhaochen0110 / OpenThinkIMG
View on GitHub
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
☆399Jun 1, 2025Updated last year
penghao-wu / visual_jigsaw
View on GitHub
☆78Apr 9, 2026Updated 3 months ago
TencentARC / ARC-Hunyuan-Video-7B
View on GitHub
Structured Video Comprehension of Real-World Shorts
☆239Sep 21, 2025Updated 10 months ago
stepfun-ai / StepFun-Prover-Preview
View on GitHub
Large language models designed for formal theorem proving through tool-integrated reasoning.
☆33Aug 13, 2025Updated 11 months ago
alibaba-damo-academy / VL-Cogito
View on GitHub
☆24Nov 4, 2025Updated 8 months ago