FreedomIntelligence / Awesome-LLM-Patient-SimulatorsLinks
A Paper collection for LLM based Patient Simulators
☆91Updated last month
Alternatives and similar repositories for Awesome-LLM-Patient-Simulators
Users that are interested in Awesome-LLM-Patient-Simulators are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆79Updated last month
- Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making☆233Updated last year
- [NeurIPS 2025] This is the official repository for "RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis"☆26Updated 2 months ago
- [ICML 2025] MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding☆142Updated 6 months ago
- ☆130Updated last year
- ☆36Updated last year
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆73Updated 4 months ago
- EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images (NeurIPS 2023 D&B)☆91Updated this week
- Official repository of the MIRAGE benchmark☆192Updated this week
- Clinical NLP Shared Task @ NAACL'24☆40Updated 5 months ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆121Updated last year
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆76Updated 9 months ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆21Updated last year
- MedSafetyBench: Evaluating and Improving the Medical Safety of LLMs, NeurIPS 2024☆40Updated 2 months ago
- Benchmark, Toolbox, and Reflection-based Method for Clinical Agent☆17Updated last year
- Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions☆48Updated 7 months ago
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆95Updated this week
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆48Updated last month
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆94Updated last year
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆30Updated 2 years ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆23Updated last year
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆41Updated last year
- [Nature Communications] The official code for "Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases".☆56Updated 3 months ago
- The repository for "MedChain: Bridging the Gap Between LLM Agents and Real-World Clinical Decision Making"☆43Updated 4 months ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆256Updated 7 months ago
- AMEGA-LLM: Autonomous Medical Evaluation for Guideline Adherence of Large Language Models☆24Updated 2 weeks ago
- DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue☆64Updated 2 weeks ago
- ☆48Updated 11 months ago
- ☆29Updated 2 years ago
- Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records☆26Updated last year