JarvisUSTC / DoctorAgent-RLLinks
DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue
☆38Updated last month
Alternatives and similar repositories for DoctorAgent-RL
Users that are interested in DoctorAgent-RL are comparing it to the libraries listed below
Sorting:
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆128Updated 4 months ago
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆44Updated 7 months ago
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆139Updated last month
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆54Updated 4 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆92Updated 2 weeks ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆118Updated 2 months ago
- This repository is aim to reproduce the R1-Zero on medical domain.☆31Updated 5 months ago
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆39Updated 2 months ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆57Updated 3 months ago
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆59Updated 5 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆168Updated last month
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆82Updated 3 weeks ago
- ☆109Updated 2 months ago
- [EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning☆34Updated 10 months ago
- MokA: Multimodal Low-Rank Adaptation for MLLMs☆50Updated 4 months ago
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆94Updated 3 weeks ago
- MAM: ModularMulti-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration☆22Updated 4 months ago
- ☆107Updated 7 months ago
- [arxiv'25] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale☆66Updated 3 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆88Updated 11 months ago
- ☆84Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Updated 6 months ago
- Multimodal Instruction Tuning with Conditional Mixture of LoRA (ACL 2024)☆31Updated last year
- Benchmark, Toolbox, and Reflection-based Method for Clinical Agent☆12Updated last year
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆38Updated 5 months ago
- MC-CoT implementation code☆20Updated 4 months ago
- ☆39Updated 10 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆52Updated 8 months ago
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆77Updated 11 months ago
- Code for the paper "RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection" (ACL'25).☆30Updated 3 months ago