openlifescience-ai / Open-Medical-Reasoning-Tasks
A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)
☆116Updated 6 months ago
Alternatives and similar repositories for Open-Medical-Reasoning-Tasks:
Users that are interested in Open-Medical-Reasoning-Tasks are comparing it to the libraries listed below
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆94Updated last year
- Agent benchmark for medical diagnosis☆173Updated 3 months ago
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆49Updated 9 months ago
- Clinical text summarization by adapting large language models☆138Updated 8 months ago
- ☆115Updated 7 months ago
- Automating enterprise workflows with multimodal agents☆103Updated 5 months ago
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆61Updated last month
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆170Updated last year
- Train your own SOTA deductive reasoning model☆81Updated 3 weeks ago
- ☆53Updated last year
- ☆89Updated last month
- ☆241Updated 10 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆76Updated 6 months ago
- One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.☆21Updated 3 weeks ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆88Updated 3 months ago
- Towards Medical Small Language Models with Self-Evolved \\ Slow Thinking☆66Updated 2 months ago
- Code for the MedRAG toolkit☆296Updated last month
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆76Updated 6 months ago
- ☆50Updated last year
- OLAPH: Improving Factuality in Biomedical Long-form Question Answering☆38Updated 6 months ago
- Biomedical Question Answering Datasets.☆99Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆107Updated last month
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆92Updated 5 months ago
- ☆107Updated 2 weeks ago
- ☆143Updated 8 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆74Updated 2 weeks ago
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.☆46Updated 5 months ago
- Function Calling Benchmark & Testing☆85Updated 8 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆65Updated 9 months ago