Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
☆48Jul 10, 2025Updated 9 months ago
Alternatives and similar repositories for ChallengeClinicalQA
Users that are interested in ChallengeClinicalQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆48Dec 21, 2025Updated 4 months ago
- Efficient Approach for Guided Local Examination in Digital Pathology☆35Apr 26, 2026Updated last week
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆155Jul 17, 2025Updated 9 months ago
- H&E ROI-Level and WSI-Level Nuclei Segmentation with HoVer-Net☆10Jul 30, 2024Updated last year
- ☆93Feb 8, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆43Jan 28, 2026Updated 3 months ago
- Dataset of 57 mock medical primary care consultations: audio, consultation notes, human utterance-level transcripts.☆80Nov 16, 2022Updated 3 years ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- Code base for publication: Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems☆10Feb 1, 2023Updated 3 years ago
- Official code release for Deep Extreme Mixture Model by Wilson, McDonald, Galib, Tan, and Luo.☆10Feb 11, 2022Updated 4 years ago
- ☆40Jan 14, 2025Updated last year
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆270Jun 19, 2025Updated 10 months ago
- Evaluating LLMs for medical applications☆15Nov 30, 2023Updated 2 years ago
- Agent benchmark for medical diagnosis☆301Dec 31, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Make machine learning simpler with Galaxy☆12Jul 16, 2024Updated last year
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆25Updated this week
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆200Oct 15, 2024Updated last year
- ☆10Mar 24, 2022Updated 4 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Jul 16, 2023Updated 2 years ago
- ☆129May 8, 2024Updated 2 years ago
- 💵 Code for Less is More for Long Document Summary Evaluation by LLMs (Wu*, Iso* et al; EACL 2024)☆11Feb 22, 2024Updated 2 years ago
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆23Aug 28, 2025Updated 8 months ago
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature☆100Mar 22, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆49Jun 2, 2025Updated 11 months ago
- ☆12Nov 23, 2021Updated 4 years ago
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).☆11Dec 28, 2022Updated 3 years ago
- ☆62Mar 9, 2026Updated 2 months ago
- Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environ…☆55Aug 27, 2025Updated 8 months ago
- ☆48Feb 26, 2025Updated last year
- Large Language Models to Identify Social Determinants of Health in Electronic Health Records | Paper: https://www.nature.com/articles/s41…☆51Jan 11, 2024Updated 2 years ago
- Code for paper Edge-Enhanced Dilated Residual Attention Network for Multimodal Medical Image Fusion☆12Nov 18, 2024Updated last year
- ☆17Jan 31, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Released code for the paper 'End-to-end Multiple Instance Learning for Whole-Slide Cytopathology of Urothelial Carcinoma'☆10Nov 24, 2021Updated 4 years ago
- ☆13Oct 17, 2020Updated 5 years ago
- ☆14May 23, 2022Updated 3 years ago
- Implementation Code for "LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination"☆14Apr 25, 2025Updated last year
- Swift package for seamless audio recording and playback in iOS apps.☆14Sep 27, 2024Updated last year
- ☆115Aug 4, 2025Updated 9 months ago
- code related to submission of "Accurate diagnosis of lymphoma on whole slide histopathology images using deep learning"☆10May 12, 2020Updated 5 years ago