[NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆53Sep 29, 2025Updated 5 months ago
Alternatives and similar repositories for TON
Users that are interested in TON are comparing it to the libraries listed below
Sorting:
- R3: Robust Rubric-Agnostic Reward Models☆20Jul 12, 2025Updated 7 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆53Dec 13, 2025Updated 2 months ago
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆23Feb 11, 2026Updated 3 weeks ago
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 7 months ago
- ☆11Jun 21, 2025Updated 8 months ago
- Code and resources for the NeurIPS 2025 Paper "BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset" by Zhiheng X…☆19Oct 14, 2025Updated 4 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- ☆23Sep 19, 2024Updated last year
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆37Apr 21, 2025Updated 10 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 8 months ago
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆18Aug 28, 2025Updated 6 months ago
- Official implementation of "PAPR in Motion: Seamless Point-level 3D Scene Interpolation"☆13Nov 6, 2024Updated last year
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 7 months ago
- 🩻 NV-Reason-CXR-3B is a specialized vision-language model designed for medical reasoning and interpretation of chest X-ray images.☆43Feb 25, 2026Updated last week
- Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight☆13May 26, 2025Updated 9 months ago
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆32Nov 25, 2025Updated 3 months ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆27Jan 10, 2026Updated last month
- Repository for the paper: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning☆18Feb 21, 2025Updated last year
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)☆13Apr 17, 2024Updated last year
- ☆21Jul 21, 2025Updated 7 months ago
- Sotopia-RL: Reward Design for Social Intelligence☆46Jan 29, 2026Updated last month
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆60Sep 15, 2025Updated 5 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆97Feb 21, 2025Updated last year
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Jun 11, 2025Updated 8 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Oct 28, 2025Updated 4 months ago
- Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning☆18Sep 26, 2025Updated 5 months ago
- ☆15Sep 23, 2024Updated last year
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆41Feb 15, 2024Updated 2 years ago
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆39Jun 4, 2025Updated 9 months ago
- This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotl…☆25Sep 29, 2025Updated 5 months ago
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Mar 11, 2025Updated 11 months ago
- ☆16Apr 30, 2024Updated last year
- ☆70Jun 18, 2025Updated 8 months ago
- MC-CoT implementation code☆22Jun 24, 2025Updated 8 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆49Updated this week
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆72Jul 10, 2024Updated last year
- Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models☆108Jul 7, 2025Updated 7 months ago
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆27Nov 20, 2025Updated 3 months ago
- The repo of ASGMVLP