Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
☆113Jul 7, 2025Updated 8 months ago
Alternatives and similar repositories for Med-R1
Users that are interested in Med-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆43Jun 29, 2025Updated 8 months ago
- DeepTumorVQA benchmark (9262 CT images + 395k QA pairs)☆31Jul 8, 2025Updated 8 months ago
- MAM: ModularMulti-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration☆39Jun 25, 2025Updated 9 months ago
- The dataset and evaluation code for MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical found…☆25Feb 19, 2026Updated last month
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆28Mar 18, 2026Updated last week
- 【IEEE TPAMI 2025】Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding☆32Mar 17, 2026Updated last week
- [ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning☆54Dec 21, 2025Updated 3 months ago
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Oct 28, 2025Updated 4 months ago
- This is the official repository for the IEEE TMI paper titled "Large Language Model with Region-Guided Referring and Grounding for CT Rep…☆68Jun 28, 2025Updated 8 months ago
- [ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"☆20Oct 19, 2023Updated 2 years ago
- Learning to Use Medical Tools with Multi-modal Agent☆233Mar 18, 2026Updated last week
- [ICCV 2025] MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs☆32Jan 26, 2026Updated last month
- [EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning☆28Jun 12, 2025Updated 9 months ago
- A list of VLMs tailored for medical RG and VQA; and a list of medical vision-language datasets☆219Mar 19, 2025Updated last year
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆142Jul 17, 2025Updated 8 months ago
- ☆14Mar 15, 2025Updated last year
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated last year
- ☆46Nov 12, 2025Updated 4 months ago
- ☆20Apr 5, 2024Updated last year
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆83Sep 19, 2025Updated 6 months ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆19Jan 11, 2026Updated 2 months ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆25Feb 21, 2025Updated last year
- ☆22Nov 27, 2025Updated 3 months ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 11 months ago
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Jun 11, 2025Updated 9 months ago
- Evaluation Pipeline for medical tasks.☆12Feb 13, 2026Updated last month
- BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks☆707Jul 8, 2025Updated 8 months ago
- The official code for MedAgent_Pro☆125Aug 26, 2025Updated 6 months ago
- [NAACL 2025] VividMed: Vision Language Model with Versatile Visual Grounding for Medicine☆29Mar 10, 2025Updated last year
- ☆11Jun 21, 2025Updated 9 months ago
- [NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative …☆71Oct 23, 2025Updated 5 months ago
- Tensorflow implementation of Spatial VAE via Matrix-Variate Normal Distributions☆14May 24, 2017Updated 8 years ago
- PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modal…☆227Dec 6, 2024Updated last year
- [ACL 2025] RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection☆34Jul 23, 2025Updated 8 months ago
- M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models☆424Apr 13, 2025Updated 11 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆54Sep 29, 2025Updated 5 months ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆23Sep 19, 2024Updated last year
- ☆36Dec 8, 2025Updated 3 months ago
- Code for TIP 2024 paper: Sparse Coding Inspired LSTM and Self-Attention Integration for Medical Image Segmentation☆13Oct 28, 2024Updated last year