abachaa / MEDEC
☆34Updated 4 months ago
Alternatives and similar repositories for MEDEC:
Users that are interested in MEDEC are comparing it to the libraries listed below
- ☆48Updated 2 months ago
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆37Updated 2 weeks ago
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆48Updated 10 months ago
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆25Updated 3 weeks ago
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆82Updated 7 months ago
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆132Updated last month
- [npj digital medicine] The official codes for "Towards Evaluating and Building Versatile Large Language Models for Medicine"☆60Updated this week
- ☆90Updated 3 months ago
- ☆43Updated 7 months ago
- ☆36Updated 3 months ago
- OLAPH: Improving Factuality in Biomedical Long-form Question Answering☆40Updated 8 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆71Updated last month
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆55Updated 2 weeks ago
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆70Updated 2 months ago
- ☆23Updated 5 months ago
- Agent benchmark for medical diagnosis☆186Updated 4 months ago
- [ISMB '24] Self-BioRAG: Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models☆62Updated last year
- ☆30Updated 6 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆30Updated this week
- Large language model of Medical AI, General Medical AI (GMAI)☆16Updated last year
- A Paper collection for LLM based Patient Simulators☆24Updated 3 weeks ago
- ☆26Updated 3 months ago
- Official repository of the MIRAGE benchmark☆136Updated 6 months ago
- Dataset of paper: On the Compositional Generalization of Multimodal LLMs for Medical Imaging☆32Updated 4 months ago
- Path to Medical AGI: Unify Domain-specific Medical LLMs with the Lowest Cost☆38Updated last year
- ☆49Updated 2 years ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆93Updated 4 months ago
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆185Updated 6 months ago
- NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records☆36Updated 4 months ago
- ☆37Updated last year