Teddy-XiongGZ / MIRAGELinks
Official repository of the MIRAGE benchmark
☆169Updated 11 months ago
Alternatives and similar repositories for MIRAGE
Users that are interested in MIRAGE are comparing it to the libraries listed below
Sorting:
- Code for the MedRAG toolkit☆433Updated 4 months ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆73Updated 2 weeks ago
- [ACL 2024 Findings] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning https://arxiv.org/abs/2311.10537☆287Updated last year
- Code and data for MedQA☆312Updated 2 years ago
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆196Updated 11 months ago
- The paper list of the review on LLMs in medicine - "Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assis…☆252Updated last year
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆109Updated 9 months ago
- ☆126Updated last year
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆92Updated last year
- A curated list of popular Datasets, Models and Papers for LLMs in Medical/Healthcare☆284Updated last year
- Biomedical Question Answering Datasets.☆114Updated 5 months ago
- [ISMB 2024] Self-BioRAG: Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models☆64Updated last year
- Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making☆195Updated 10 months ago
- Official repository for RAG-Gym☆113Updated 7 months ago
- A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.☆234Updated 2 years ago
- [Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"☆269Updated 4 months ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆222Updated 3 months ago
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆72Updated 5 months ago
- A Chinese National Medical Licensing Examination dataset and large languge model benchmarks☆74Updated last year
- Clinical NLP Shared Task @ NAACL'24☆35Updated last month
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆208Updated last year
- ☆66Updated 8 months ago
- Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)☆163Updated 2 weeks ago
- A Paper collection for LLM based Patient Simulators☆59Updated 3 weeks ago
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆28Updated last year
- [ACL Oral 2025] The official GitHub repository for TC-RAG (Turing-Complete RAG)☆68Updated 7 months ago
- Agent benchmark for medical diagnosis☆234Updated 9 months ago
- ☆91Updated 7 months ago
- This repository contains the code for our paper "Augmenting Black-box LLMs with Medical Textbooks for Clinical Question Answering" [EMNLP…☆13Updated 11 months ago
- Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions☆42Updated 2 months ago