openlifescience-ai / Open-Medical-Reasoning-TasksLinks
A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)
☆126Updated 11 months ago
Alternatives and similar repositories for Open-Medical-Reasoning-Tasks
Users that are interested in Open-Medical-Reasoning-Tasks are comparing it to the libraries listed below
Sorting:
- Agent benchmark for medical diagnosis☆221Updated 7 months ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆95Updated 3 months ago
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆200Updated last year
- ☆91Updated 6 months ago
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.☆72Updated 10 months ago
- Clinical text summarization by adapting large language models☆149Updated last year
- ☆118Updated last year
- ☆55Updated last year
- MIRIAD is a million scale Medical Instruction and RetrIeval Datatset☆117Updated this week
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆208Updated 2 months ago
- ☆145Updated last year
- Automating enterprise workflows with multimodal agents☆110Updated 10 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆118Updated 6 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆108Updated last year
- A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).☆360Updated 2 years ago
- Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"☆19Updated 6 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- ☆31Updated 3 months ago
- ☆50Updated 2 years ago
- Function Calling Benchmark & Testing☆89Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆88Updated 10 months ago
- ☆38Updated 2 months ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆47Updated 3 weeks ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 7 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆285Updated 5 months ago
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (early accepted)☆80Updated last month
- ☆94Updated 5 months ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆219Updated 3 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆100Updated 3 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year