MAGIC-AI4Med/MedRBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MAGIC-AI4Med/MedRBench)

MAGIC-AI4Med / MedRBench

[Nature Communications] The official code for "Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases".

☆69

Alternatives and similar repositories for MedRBench

Users that are interested in MedRBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MAGIC-AI4Med / ChestX-Reasoner
View on GitHub
☆39Mar 19, 2026Updated 4 months ago
MAGIC-AI4Med / DiagGym
View on GitHub
A virtual clinical environment for self‑evolving LLM diagnostic agents.
☆108Feb 12, 2026Updated 5 months ago
MAGIC-AI4Med / Deep-DxSearch
View on GitHub
An agentic RL framework to enhance retreival-augmented reasoning in Diagnostic Policy
☆103Feb 27, 2026Updated 4 months ago
MAGIC-AI4Med / RadABench
View on GitHub
The official codes for "Can Modern LLMs Act as Agent Cores in Radiology Environments?"
☆29Jan 22, 2025Updated last year
MAGIC-AI4Med / SAT
View on GitHub
The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"
☆10Aug 16, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Medlinker-MG / CSEDB
View on GitHub
CSEDB - Clinical Safety-Effectiveness Dual-Track Benchmark
☆20Aug 13, 2025Updated 11 months ago
uni-medical / GMAI-VL-R1
View on GitHub
☆19Jul 21, 2025Updated last year
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
MAGIC-AI4Med / RaTEScore
View on GitHub
[EMNLP 2024] RaTEScore: A Metric for Radiology Report Generation
☆67May 18, 2025Updated last year
MAGIC-AI4Med / RP3D-Diag
View on GitHub
Code implementation of RP3D-Diag
☆17Nov 25, 2024Updated last year
AQ-MedAI / LiveClin
View on GitHub
LiveClin is a live benchmark designed for the faithful replication of clinical practice
☆16Feb 27, 2026Updated 4 months ago
zhaoziheng / OmniAbnorm-CT
View on GitHub
[CVPR 2026 Findings] Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach
☆25Jun 11, 2026Updated last month
yangyan22 / Medical-Report-Generation-TriNet
View on GitHub
Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation
☆18Nov 13, 2025Updated 8 months ago
paulhager / MIMIC-Clinical-Decision-Making-Dataset
View on GitHub
Code repository to create the MIMIC-CDM Dataset.
☆48Feb 7, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BlueZeros / AgentEHR
View on GitHub
Agentic System, Tool Use, Electronic Health Record, Large Language Models, Clinical Nature Language Processing
☆24Apr 13, 2026Updated 3 months ago
MAGIC-AI4Med / MedS-Ins
View on GitHub
[npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"
☆79May 5, 2025Updated last year
MAGIC-AI4Med / EHR-R1
View on GitHub
☆37May 18, 2026Updated 2 months ago
MAGIC-AI4Med / M3Builder
View on GitHub
The official codes for "M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging"
☆45Jul 28, 2025Updated 11 months ago
xiaoman-zhang / KAD
View on GitHub
☆158Aug 29, 2024Updated last year
SUSTechBruce / Med-UniC
View on GitHub
official implementation of "Med-Unic: unifying cross-lingual medical vision-language pre-training by diminishing bias"
☆18Sep 22, 2023Updated 2 years ago
qiaoyu-zheng / RP3D-Diag
View on GitHub
Code implementation of RP3D-Diag
☆79Aug 29, 2025Updated 10 months ago
MAGIC-AI4Med / KEP
View on GitHub
[ECCV 2024 Oral] Knowledge-enhanced pretraining for computational pathology
☆50Apr 17, 2026Updated 3 months ago
MediaBrain-SJTU / FedGELA
View on GitHub
[NeurIPS 2023]Federated Learning with Bilateral Curation for Partially Class-Disjoint Data
☆14Aug 1, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
gersteinlab / MedicalAgentsBench
View on GitHub
[Patterns] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
☆83Mar 10, 2026Updated 4 months ago
guanjinquan / CXRTrek
View on GitHub
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight
☆13May 26, 2025Updated last year
ncbi-nlp / Clinical-Tool-Learning
View on GitHub
☆27Aug 10, 2025Updated 11 months ago
eth-medical-ai-lab / Med-PRM
View on GitHub
[EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
☆68Sep 15, 2025Updated 10 months ago
chaoyi-wu / GPT-4V_Medical_Evaluation
View on GitHub
☆44Oct 20, 2023Updated 2 years ago
ljy19970415 / UniBrain
View on GitHub
An offcial implementation for UniBrain: Universal Brain MRI Diagnosis with Hierarchical Knowledge-enhanced Pre-training
☆39Mar 10, 2025Updated last year
MAGIC-AI4Med / MMedLM
View on GitHub
[Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"
☆284May 9, 2025Updated last year
TsinghuaC3I / MedXpertQA
View on GitHub
[ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
☆170Jul 17, 2025Updated last year
canyuchen / ClinicalBench
View on GitHub
Code for the KDD'26 paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"
☆35Jun 29, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated 11 months ago
PerceptionComputingLab / MedFILIP
View on GitHub
[IEEE-JBHI 2025] Pytorch implementation of the paper "MedFILIP: Medical Fine-Grained Language-Image Pre-Training s"
☆26Jan 18, 2025Updated last year
SPIRAL-MED / DiagnosisArena
View on GitHub
☆33Jun 26, 2026Updated 3 weeks ago
wbw520 / DiReCT
View on GitHub
DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models (NeurIPS 2024 D&B Track)
☆24Mar 6, 2025Updated last year
ljy19970415 / AutoRG-Brain
View on GitHub
The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".
☆59Jan 6, 2026Updated 6 months ago
UCSC-VLAA / MedReason
View on GitHub
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
☆280Jun 19, 2025Updated last year
MAGIC-AI4Med / DeepRare
View on GitHub
Code implementation of DeepRare (Nature 2026)
☆271Apr 14, 2026Updated 3 months ago