Official repository of the MIRAGE benchmark
☆197Feb 6, 2026Updated last month
Alternatives and similar repositories for MIRAGE
Users that are interested in MIRAGE are comparing it to the libraries listed below
Sorting:
- Code for the MedRAG toolkit☆519May 8, 2025Updated 10 months ago
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆237Mar 24, 2024Updated last year
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆82Dec 18, 2025Updated 2 months ago
- Code and data for MedQA☆361Dec 1, 2022Updated 3 years ago
- PubMedQA: A Dataset for Biomedical Research Question Answering☆412Apr 18, 2023Updated 2 years ago
- ☆41May 22, 2025Updated 9 months ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆94Sep 26, 2024Updated last year
- Biomedical Question Answering Datasets.☆124Apr 30, 2025Updated 10 months ago
- ☆37Jan 26, 2025Updated last year
- [ISMB 2024] Self-BioRAG: Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models☆64Apr 4, 2024Updated last year
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆77May 5, 2025Updated 10 months ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- ☆48Feb 26, 2025Updated last year
- EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (NeurIPS 2023 D&B)☆91Feb 6, 2026Updated last month
- CMB, A Comprehensive Medical Benchmark in Chinese☆232Mar 27, 2025Updated 11 months ago
- Large language model of Medical AI, General Medical AI (GMAI)☆17Jan 30, 2024Updated 2 years ago
- [Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"☆276May 9, 2025Updated 10 months ago
- ☆103Jun 6, 2024Updated last year
- Official repository for RAG-Gym☆121Mar 4, 2025Updated last year
- A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.☆261Nov 28, 2022Updated 3 years ago
- A specialized LLM for study search, study screening, and data extraction from medical literature.☆26Mar 10, 2025Updated 11 months ago
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Jun 11, 2025Updated 8 months ago
- Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making☆242Nov 10, 2024Updated last year
- A Python tool to evaluate the performance of VLM on the medical domain.☆83Aug 5, 2025Updated 7 months ago
- The evaluation framework for the InfiCoder-Eval benchmark.☆21Jul 22, 2024Updated last year
- The resources for LMKG (a large-scale, high-quality, multi-source, and multi-lingual medical knowledge graph).☆22Sep 7, 2023Updated 2 years ago
- DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models (NeurIPS 2024 D&B Track)☆23Mar 6, 2025Updated last year
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆142Jul 17, 2025Updated 7 months ago
- Learning to Use Medical Tools with Multi-modal Agent☆230Feb 7, 2026Updated last month
- ☆20Mar 22, 2024Updated last year
- ☆23Jan 16, 2024Updated 2 years ago
- Joint learning of images and text via maximization of mutual information☆19Dec 14, 2021Updated 4 years ago
- ☆21Aug 19, 2024Updated last year
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆25Feb 21, 2025Updated last year
- [ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.☆41Sep 19, 2024Updated last year
- ☆52Aug 14, 2024Updated last year
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆23Sep 19, 2024Updated last year
- ☆25Updated this week
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆24Feb 11, 2026Updated 3 weeks ago