DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue
☆91Jan 23, 2026Updated 5 months ago
Alternatives and similar repositories for DoctorAgent-RL
Users that are interested in DoctorAgent-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆75Jun 5, 2025Updated last year
- MC-CoT implementation code☆23Jun 24, 2025Updated last year
- ☆11Nov 3, 2024Updated last year
- Adversaial attack comparative assessment Large Language Model☆13May 21, 2025Updated last year
- ☆38Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆21Nov 29, 2022Updated 3 years ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- MAM: ModularMulti-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration☆52Apr 3, 2026Updated 3 months ago
- ☆16May 23, 2025Updated last year
- ☆14May 22, 2023Updated 3 years ago
- This is the official code of DeepSearch [ICLR 2026]☆33Oct 22, 2025Updated 8 months ago
- The original code for paper "Towards a Holistic Framework for Multimodal LLM in 3D Brain CT Radiology Report Generation"☆50Apr 24, 2025Updated last year
- ☆15Dec 9, 2022Updated 3 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆19Dec 17, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code for PECRS (EACL 2024)☆12Feb 3, 2024Updated 2 years ago
- G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation☆21Mar 5, 2025Updated last year
- Code for "Neural Speed Reading with Structural-Jump-LSTM" ICLR 2019☆25Feb 22, 2019Updated 7 years ago
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆48Apr 21, 2025Updated last year
- Code for "In-Context Former: Lightning-fast Compressing Context for Large Language Model" (Findings of EMNLP 2024)☆21Nov 21, 2024Updated last year
- ☆25Oct 29, 2024Updated last year
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆24May 6, 2026Updated last month
- ☆12Apr 8, 2021Updated 5 years ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆68Sep 15, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A flexible and modular learning platform for medical images☆27May 11, 2026Updated last month
- [ECCV 2022] "Adversarial Contrastive Learning via Asymmetric InfoNCE"☆24Dec 12, 2022Updated 3 years ago
- ☆47Mar 14, 2025Updated last year
- ☆11Apr 12, 2024Updated 2 years ago
- Edge-oriented Point cloud Transformer for 3D Intracranial Aneurysm Segmentation. MICCAI22☆13Aug 18, 2022Updated 3 years ago
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆27Apr 24, 2025Updated last year
- memory-efficient fine-tuning; support 24G GPU memory fine-tuning 7B☆21May 26, 2024Updated 2 years ago
- The official codebase for "Experiential Reinforcement Learning" - https://arxiv.org/pdf/2602.13949v1☆72May 8, 2026Updated last month
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Open-source datasets for paper "Fairness in Graph Mining: A Survey".☆19Nov 3, 2022Updated 3 years ago
- Official code for 'One-Shot Object Localization in Medical Images based on Relative Position Regression'.☆13Sep 10, 2022Updated 3 years ago
- ☆42Jan 26, 2025Updated last year
- The implementation for the Recsys paper: Towards Empathetic Conversational Recommender System☆26Sep 3, 2024Updated last year
- This is the official implementation of the paper "Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness,"…☆19Jul 15, 2024Updated last year
- An implementation for Generator Versus Segmentor: Pseudo-healthy Synthesis☆12Oct 22, 2021Updated 4 years ago
- Official Implementation of "D4Explainer: In-Distribution GNN Explanations via Discrete Denoising Diffusion"☆23Oct 29, 2023Updated 2 years ago