JarvisUSTC / DoctorAgent-RLLinks
DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue
☆19Updated last month
Alternatives and similar repositories for DoctorAgent-RL
Users that are interested in DoctorAgent-RL are comparing it to the libraries listed below
Sorting:
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆42Updated 4 months ago
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆36Updated 3 months ago
- [EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning☆33Updated 8 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆81Updated 3 weeks ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆47Updated 4 months ago
- ☆38Updated 7 months ago
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆37Updated 3 months ago
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆103Updated last month
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆51Updated 3 months ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆62Updated 2 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆83Updated 2 weeks ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆42Updated 2 weeks ago
- CLIP-MoE: Mixture of Experts for CLIP☆46Updated 11 months ago
- Multimodal Instruction Tuning with Conditional Mixture of LoRA (ACL 2024)☆32Updated last year
- ☆41Updated 2 months ago
- This repository is aim to reproduce the R1-Zero on medical domain.☆30Updated 2 months ago
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆113Updated 2 months ago
- SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆132Updated 4 months ago
- ☆38Updated 10 months ago
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆25Updated 4 months ago
- [ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning☆64Updated 7 months ago
- MC-CoT implementation code☆19Updated 2 months ago
- ☆100Updated 5 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆38Updated 2 months ago
- ☆48Updated 6 months ago
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.☆70Updated 8 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆70Updated last year
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆55Updated last month
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆145Updated 2 months ago
- MedEvalKit: A Unified Medical Evaluation Framework☆139Updated last week