openlifescience-ai / Open-Medical-Reasoning-Tasks
A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)
☆112Updated 5 months ago
Alternatives and similar repositories for Open-Medical-Reasoning-Tasks:
Users that are interested in Open-Medical-Reasoning-Tasks are comparing it to the libraries listed below
- Agent benchmark for medical diagnosis☆161Updated 2 months ago
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆164Updated 11 months ago
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.☆41Updated 4 months ago
- MedAlign is a clinician-generated dataset for instruction following with electronic medical records.☆91Updated last year
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆47Updated 8 months ago
- Automating enterprise workflows with multimodal agents☆100Updated 4 months ago
- Code for the MedRAG toolkit☆271Updated last week
- Claude API Test Project☆87Updated 10 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated last month
- Clinical text summarization by adapting large language models☆133Updated 7 months ago
- ☆110Updated 6 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆224Updated 2 weeks ago
- ☆30Updated 2 months ago
- ☆230Updated 9 months ago
- Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"☆19Updated last week
- ☆83Updated last month
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆104Updated 3 weeks ago
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated last month
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆166Updated 10 months ago
- ☆125Updated 6 months ago
- RAG example using DSPy, Gradio, FastAPI☆75Updated 10 months ago
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- Function Calling Benchmark & Testing☆83Updated 7 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆100Updated 10 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 7 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆119Updated last week
- Vector Guideline Comparison for Medical Oncology☆29Updated 9 months ago
- This is Clinfo.AI Demo Instruction☆34Updated 6 months ago