DCDmllm / HealthGPTLinks

【ICML 2025 Spotlight】 Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’

☆1,588

Alternatives and similar repositories for HealthGPT

Users that are interested in HealthGPT are comparing it to the libraries listed below

Sorting:

ZJUI-AI4H / Hulu-Med
A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding
☆570Updated 2 months ago
Tuner12 / Shazam
☆37Updated this week
microsoft / DeepVideoDiscovery
**Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.
☆346Updated 3 months ago
Yore0 / TTDG-MGM
[CVPR 2025] Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image Segmentation
☆95Updated 7 months ago
OpenGVLab / ScaleCUA
ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu, Android).
☆1,065Updated 3 weeks ago
InternScience / InternAgent
When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification
☆841Updated 2 months ago
QuantaAlpha / KnowMeBench
☆110Updated 2 weeks ago
lwpyh / Awesome-MLLM-Reasoning-Collection
A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.
☆562Updated last month
InternScience / SurveyForge
(ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…
☆316Updated 5 months ago
HITsz-TMG / Uni-MoE
Uni-MoE: Lychee's Large Multimodal Model Family.
☆1,074Updated last month
HJYao00 / Mulberry
[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
☆1,238Updated 2 weeks ago
pokerme7777 / Compositional-Visual-Reasoning-Survey
Explain Before You Answer: A Survey on Compositional Visual Reasoning
☆306Updated 3 months ago
easydoc-ai / easydoc
☆1,112Updated 6 months ago
microsoft / MIRA
MIRA: Medical Time Series Foundation Model for Real-World Health Data
☆153Updated last week
yifangao112 / DinoUNet
Official repository for Dino U-Net: Exploiting High-Fidelity Dense Features from Foundation Models for Medical Image Segmentation. (DINOv…
☆263Updated 5 months ago
ShuchangYe-bib / ProLearn
[ICCV 2025] Official Implementation of "ProLearn: Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driv…
☆54Updated 4 months ago
lingxitong / MIL_BASELINE
A library that integrates different MIL methods into a unified framework
☆305Updated last week
YihuaJerry / EventVAD
[MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection
☆518Updated 6 months ago
Xiaoqi-Zhao-DLUT / Spider-UniCDSeg
(ICML 2024) Spider: A Unified Framework for Context-dependent Concept Segmentation
☆353Updated 10 months ago
Everlyn-Labs / ANTRP
Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs
☆163Updated 10 months ago
yuisuen / DAW
Official implementation of "DAW: Exploring the Better Weighting Function for Semi-supervised Semantic Segmentation" (NeurIPS 2023)
☆36Updated 11 months ago
FreedomIntelligence / HuatuoGPT-Vision
Medical Multimodal LLMs
☆371Updated 9 months ago
Emo-gml / EmoBench-M
EmoBench-M: A benchmark for evaluating Emotional Intelligence in Multimodal Large Language Models.
☆131Updated this week
DEEP-PolyU / Awesome-LLM-based-Text2SQL
[TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchma…
☆1,255Updated last week
QuantaAlpha / RepoMaster
RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of aut…
☆480Updated 3 months ago
guanhaisu / OBSD
Deciphering Oracle Bone Language with Diffusion Models (ACL 2024 Best Paper)
☆225Updated 4 months ago
tulerfeng / OneThinker
🔥 OneThinker: All-in-one Reasoning Model for Image and Video
☆388Updated 3 weeks ago
wcm-wanglab / iBKH
iBKH: The integrative Biomedical Knowledge Hub
☆513Updated 2 weeks ago
WAMAWAMA / WAMA_Modules
A PyTorch Computer Vision (CV) module library for building n-D networks flexibly ~
☆364Updated last year
AlenjandroWang / ASVR
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better
☆186Updated this week