DATEXIS/AMEGA-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DATEXIS/AMEGA-benchmark)

DATEXIS / AMEGA-benchmark

AMEGA-LLM: Autonomous Medical Evaluation for Guideline Adherence of Large Language Models

☆31

Alternatives and similar repositories for AMEGA-benchmark

Users that are interested in AMEGA-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

koudounasalkis / AI4Voice
View on GitHub
This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024
☆15Jun 11, 2024Updated 2 years ago
Medlinker-MG / CSEDB
View on GitHub
CSEDB - Clinical Safety-Effectiveness Dual-Track Benchmark
☆20Aug 13, 2025Updated 11 months ago
SPIRAL-MED / DiagnosisArena
View on GitHub
☆33Jun 26, 2026Updated last month
jomoll / onco-agent
View on GitHub
☆19May 8, 2026Updated 2 months ago
Augmented-Nature / OpenFDA-MCP-Server
View on GitHub
☆20Dec 21, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dustn1259 / EHRCon
View on GitHub
Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records
☆31Aug 21, 2024Updated last year
FreedomIntelligence / SepsisAgent
View on GitHub
Agentifying Patient Dynamics within LLMs through Interacting with Clinical World Model
☆30May 15, 2026Updated 2 months ago
pqpq17 / Awesome-LLM-Reasoning-on-Medicine
View on GitHub
The Official Repo for Paper: Aligning Clinical Needs and AI Capabilities: A Survey on LLMs for Medical Reasoning
☆24Apr 7, 2026Updated 3 months ago
jmandel / kiln
View on GitHub
Baking FHIR from raw clay
☆27Oct 1, 2025Updated 9 months ago
zhao-zy15 / RareArena
View on GitHub
A Comprehensive Rare Disease Diagnostic Dataset with nearly 50,000 patients covering more than 4000 diseases
☆49Mar 13, 2026Updated 4 months ago
zruiii / QwenAudioSFT
View on GitHub
The repoduction codes for Qwen-Audio Fine-tuning
☆55Feb 28, 2026Updated 4 months ago
EmilyAlsentzer / rare-disease-simulation
View on GitHub
Simulate patients with rare genetic conditions
☆24Jul 28, 2023Updated 2 years ago
MIC-DKFZ / image-time-series
View on GitHub
Code for deep learning-based glioma/tumor growth models
☆27Nov 30, 2021Updated 4 years ago
StanfordBDHG / phoenix
View on GitHub
Web-based HL7® FHIR® Questionnaire Builder
☆27Nov 18, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
glee4810 / FHIR-AgentBench
View on GitHub
Code and Data for FHIR-AgentBench
☆26Dec 15, 2025Updated 7 months ago
yandex-research / AsyncReasoning
View on GitHub
☆25Jun 25, 2026Updated last month
kevinwu23 / Stanford-MedCaseReasoning
View on GitHub
☆51Jun 2, 2025Updated last year
mitmedialab / medical_hallucination
View on GitHub
Medical Hallucination in Foundation Models and Their Impact on Healthcare (2025)
☆83Nov 5, 2025Updated 8 months ago
Ildaron / OpenCV-image-preprocessing-python
View on GitHub
Shortest versions of python script for image processing with OpenCV
☆13Jun 16, 2022Updated 4 years ago
kbressem / faimed3d
View on GitHub
Extension to fastai for volumetric medical data
☆33Apr 12, 2023Updated 3 years ago
trevorpfiz / scribeHC
View on GitHub
Open source AI ambient scribe app for healthcare
☆21Jul 8, 2024Updated 2 years ago
mila-iqia / ddxplus
View on GitHub
☆125Aug 4, 2025Updated 11 months ago
StanfordBDHG / ResearchKit
View on GitHub
ResearchKit with Swift Package Manager (SPM), SwiftUI, C++ Interoperability, and visionOS support
☆12Jan 25, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
stellalisy / mediQ
View on GitHub
☆43Jan 26, 2025Updated last year
agsarthak / Goal-oriented-Dialogue-Systems
View on GitHub
Applying Deep Reinforcement Learning for dialogue generation. aka chatbot
☆13Apr 30, 2017Updated 9 years ago
stanfordmlgroup / MedAgentBench
View on GitHub
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents
☆306Nov 21, 2025Updated 8 months ago
NoManNayeem / Langchain_CrewAI_Gemini-AI_Agents
View on GitHub
Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.
☆14Mar 24, 2024Updated 2 years ago
Healthedata1 / mFHIR
View on GitHub
Open mHealth Schema mapping to FHIR resources. This site is published at:
☆12Dec 9, 2022Updated 3 years ago
NathanaelBeau / CodeInsight
View on GitHub
The CodeInsight dataset is designed for code generation tasks, providing developers with expert-curated examples that bridge the gap betw…
☆15Oct 22, 2024Updated last year
SamuelSchmidgall / AgentClinic
View on GitHub
Agent benchmark for medical diagnosis
☆338Dec 31, 2024Updated last year
hkm5558 / KMPageControl
View on GitHub
一种常见样式的PageControl 继承于UIPageControl
☆16Nov 23, 2020Updated 5 years ago
dek924 / PatientSim
View on GitHub
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)
☆39Apr 9, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HanjieChen / ChallengeClinicalQA
View on GitHub
Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
☆50Jul 10, 2025Updated last year
byte-genie / examples-genie
View on GitHub
Usage examples for byte-genie API
☆12Apr 27, 2024Updated 2 years ago
atultiwari / LLaVA-Med
View on GitHub
Large Language-and-Vision Assistant for BioMedicine, built towards multimodal GPT-4 level capabilities.
☆10Nov 29, 2023Updated 2 years ago
paulhager / MIMIC-Clinical-Decision-Making-Dataset
View on GitHub
Code repository to create the MIMIC-CDM Dataset.
☆48Feb 7, 2025Updated last year
disi-unibo-nlp / medgenie
View on GitHub
The First Generate-then-Read Framework for Multiple-Choice Question Answering in Medicine
☆15May 27, 2024Updated 2 years ago
ozzafar / count_token_optimization
View on GitHub
☆16Sep 6, 2024Updated last year
heqin-zhu / UOD_universal_oneshot_detection
View on GitHub
[MICCAI 2023] (early accept) UOD: universal oneshot detection of anatomical landmarks. https://arxiv.org/abs/2306.07615
☆12Jan 4, 2024Updated 2 years ago