HJYao00/R1-ShareVL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HJYao00/R1-ShareVL)

HJYao00 / R1-ShareVL

[NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward

☆38

Alternatives and similar repositories for R1-ShareVL

Users that are interested in R1-ShareVL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HJYao00 / MM-DeepResearch
View on GitHub
MLLM, DeepResearch, Agentic AI
☆18Jun 1, 2026Updated last month
w-yibo / VTC-R1
View on GitHub
VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning.
☆26Feb 20, 2026Updated 5 months ago
HJYao00 / MMReason
View on GitHub
[ICCV 2025] MMReason, MLLMs, step by step, reasoning benchmark, AGI
☆15Apr 25, 2026Updated 2 months ago
HJYao00 / Awesome-Agentic-MLLMs
View on GitHub
Agentic MLLMs
☆216Oct 24, 2025Updated 8 months ago
w-yibo / R1-Compress
View on GitHub
[NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
☆17Jan 24, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Qsingle / open-medical-r1
View on GitHub
This repository is aim to reproduce the R1-Zero on medical domain.
☆32Jun 11, 2025Updated last year
LHL3341 / ContextBLIP
View on GitHub
ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions [ACL 2024]
☆11May 17, 2024Updated 2 years ago
GAIR-NLP / Med
View on GitHub
[ICML 2026] What Does Vision Tool-Use Reinforcement Learning Really Learn? Disentangling Tool-Induced and Intrinsic Effects for Crop-and-…
☆21May 15, 2026Updated 2 months ago
TIGER-AI-Lab / VL-Rethinker
View on GitHub
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆189Jun 5, 2025Updated last year
WenjinHou / Uni-OPD
View on GitHub
Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe
☆50Jun 10, 2026Updated last month
Moenupa / VTCBench
View on GitHub
Code and data for VTCBench, a VLM benchmark for long-context understanding capabilities under vision-text compression paradigm.
☆27Mar 16, 2026Updated 4 months ago
Leevan001 / MedReason-R1
View on GitHub
MEDREASON-R1: Learning to Reason for CT Diagnosis with Reinforcement Learning and Local Zoom
☆16Oct 10, 2025Updated 9 months ago
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated 11 months ago
UCSB-AI / GRIT
View on GitHub
Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"
☆191Jan 16, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ritaranx / AceSearcher
View on GitHub
This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotl…
☆25Sep 29, 2025Updated 9 months ago
bruno686 / VisPlay
View on GitHub
[CVPR'26] VisPlay: Self-Evolving Vision-Language Models
☆63Feb 25, 2026Updated 4 months ago
ModalMinds / gym-v
View on GitHub
A unified framework for vision-language environments with Gymnasium-compatible interface
☆35Mar 17, 2026Updated 4 months ago
jdh-algo / Citrus-V
View on GitHub
Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning
☆24Sep 26, 2025Updated 9 months ago
maifoundations / Visionary-R1
View on GitHub
Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning
☆44Jul 2, 2025Updated last year
CUMTGG / CIIC
View on GitHub
☆18Sep 13, 2023Updated 2 years ago
FereshteShakeri / few-shot-MedVLMs
View on GitHub
☆33Oct 6, 2024Updated last year
CUHK-AIM-Group / MCPL
View on GitHub
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆13Apr 17, 2024Updated 2 years ago
louieworth / trd
View on GitHub
Official Implementation of Trajectory-Refined Distillation
☆26Jun 9, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
spatialdatasciencegroup / HST
View on GitHub
[NeurIPS '23] Official code of "A Hierarchical Spatial Transformer for Massive Point Samples in Continuous Space"
☆14Jul 13, 2025Updated last year
guanjinquan / CXRTrek
View on GitHub
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight
☆13May 26, 2025Updated last year
pranoyr / scene-graph-vit
View on GitHub
Implementation of the Paper Scene-Graph ViT
☆10Dec 20, 2024Updated last year
YihongDong / RL-PLUS
View on GitHub
☆27Aug 31, 2025Updated 10 months ago
tuyunbin / SCORER
View on GitHub
[ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".
☆20Sep 25, 2025Updated 9 months ago
lanxiang1017 / DynamicBadPairMining_ICLR24
View on GitHub
DBPM is a simple algorithm designed as a lightweight plug-in without learnable parameters to enhance the performance of time series contr…
☆15Mar 8, 2024Updated 2 years ago
huaixuheqing / VPPO-RL
View on GitHub
[ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"
☆69Apr 3, 2026Updated 3 months ago
google-research-datasets / maxm
View on GitHub
MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…
☆13Jan 16, 2024Updated 2 years ago
CnFaker / LLaVA-SP
View on GitHub
[ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".
☆24Oct 28, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Gabesarch / grounded-rl
View on GitHub
☆132Jul 22, 2025Updated 11 months ago
rajpurkarlab / ReXKG
View on GitHub
☆17Sep 23, 2024Updated last year
innovator-zero / SAK
View on GitHub
[ICLR2025] Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning
☆14Apr 8, 2025Updated last year
gyhdog99 / RACRO2
View on GitHub
Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)
☆19Jul 1, 2025Updated last year
PKU-YuanGroup / Look-Back
View on GitHub
This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".
☆99Jul 10, 2025Updated last year
agents-x-project / PyVision-RL
View on GitHub
[ICML 2026] Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."
☆69Feb 25, 2026Updated 4 months ago
dyhBUPT / YYDS
View on GitHub
YYDS: Visible-Infrared Person Re-Identification with Coarse Description
☆23Jun 12, 2024Updated 2 years ago