ShawnHuang497/BiRD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ShawnHuang497/BiRD)

ShawnHuang497 / BiRD

The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'

☆34

Alternatives and similar repositories for BiRD

Users that are interested in BiRD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ShawnHuang497 / RecLMIS
View on GitHub
This repo is the official implementation of "RecLMIS: Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentati…
☆32Jun 19, 2025Updated last year
ASGMVLP / ASGMVLP_CODE
View on GitHub
The repo of ASGMVLP
☆19Jan 16, 2026Updated 6 months ago
CUHK-AIM-Group / MCPL
View on GitHub
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆13Apr 17, 2024Updated 2 years ago
zhaoziheng / OmniAbnorm-CT
View on GitHub
[CVPR 2026 Findings] Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach
☆25Jun 11, 2026Updated last month
ShawnHuang497 / MedPLIB
View on GitHub
The official repository of the paper 'Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine'
☆134Jul 7, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xmed-lab / MedRegA
View on GitHub
[ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks
☆46Oct 18, 2025Updated 9 months ago
XYPB / CLEFT
View on GitHub
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…
☆18Feb 12, 2025Updated last year
Luffy03 / UniBiomed
View on GitHub
[Nature Communications 2026] A universal foundation model for grounded biomedical image interpretation
☆72Jun 12, 2026Updated last month
UCSC-VLAA / MedTrinity-25M
View on GitHub
[ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations…
☆412Jul 11, 2025Updated last year
StanfordMIMI / RaLEs
View on GitHub
Radiology Language Evaluations
☆11Nov 17, 2023Updated 2 years ago
naamiinepal / medvlsm
View on GitHub
[MIDL 2024] Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models
☆71Nov 28, 2024Updated last year
sunanhe / MedDr
View on GitHub
A generalist foundation model for healthcare capable of handling diverse medical data modalities.
☆100Apr 30, 2026Updated 2 months ago
yiqingxyq / DocLens
View on GitHub
Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)
☆22May 18, 2024Updated 2 years ago
standardmodelbio / Llama3-Med
View on GitHub
☆32Oct 18, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Tang-xiaoxiao / Medthink
View on GitHub
[ 🎯 NAACL 2025 ] MedThink: A Rationale-Guided Framework for Explaining Medical Visual Question Answering
☆18Jun 15, 2026Updated last month
HKUSTGZ-ML4Health-Lab / Med-Scout
View on GitHub
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
☆16Feb 8, 2026Updated 5 months ago
microsoft / LLaVA-Rad
View on GitHub
Official implementation of LLaVa-Rad, a small multimodal model for chest X-ray findings generation.
☆58Jan 22, 2026Updated 6 months ago
egeozsoy / ORacle
View on GitHub
Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.
☆25Jan 6, 2025Updated last year
yangyan22 / Medical-Report-Generation-TriNet
View on GitHub
Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation
☆18Nov 13, 2025Updated 8 months ago
MAGIC-AI4Med / RadABench
View on GitHub
The official codes for "Can Modern LLMs Act as Agent Cores in Radiology Environments?"
☆29Jan 22, 2025Updated last year
xmed-lab / TP-Mamba
View on GitHub
MICCAI 2024: Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images
☆28Apr 3, 2025Updated last year
ibrahimethemhamamci / BTB3D
View on GitHub
[NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging
☆42Nov 4, 2025Updated 8 months ago
rajpurkarlab / ReXrank
View on GitHub
☆28Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
manglu097 / Chiron-o1
View on GitHub
[NIPS 2025] Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative …
☆60Oct 23, 2025Updated 9 months ago
StanfordMIMI / MedVAL
View on GitHub
Toward Expert-Level Medical Text Validation with Language Models
☆18Oct 23, 2025Updated 9 months ago
Guerbet-AI / wsp-contrastive
View on GitHub
Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023
☆11Dec 16, 2025Updated 7 months ago
UCSB-AI / ProbMed
View on GitHub
Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…
☆25May 12, 2026Updated 2 months ago
jiangsongtao / Med-MoE
View on GitHub
[EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"
☆158Jul 7, 2025Updated last year
aayushmanace / PatchAlign24
View on GitHub
☆15May 15, 2025Updated last year
AIPMLab / FACMIC
View on GitHub
Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)
☆14Jun 21, 2024Updated 2 years ago
xiaofang007 / ViP
View on GitHub
[MICCAI 2024 Early Accept, Oral] Aligning Medical Images with General Knowledge from Large Language Models
☆28Mar 28, 2025Updated last year
uni-medical / GMAI-VL-R1
View on GitHub
☆19Jul 21, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
function2-llx / MMMM
View on GitHub
[NAACL 2025] VividMed: Vision Language Model with Versatile Visual Grounding for Medicine
☆31Mar 10, 2025Updated last year
yu-rp / apiprompting
View on GitHub
[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models
☆112Oct 10, 2024Updated last year
zzma2 / medical-llm-reasoning-survey
View on GitHub
A curated list of medical reasoning research on large language models, organized by modality, technique, application, and benchmark.
☆19Oct 17, 2025Updated 9 months ago
Schuture / Quality-Sentinel
View on GitHub
This is the repository of Quality Sentinel, a label quality evaluation model for medical image segmentation.
☆22Dec 3, 2025Updated 7 months ago
alibaba-damo-academy / MedEvalKit
View on GitHub
MedEvalKit: A Unified Medical Evaluation Framework
☆247Feb 24, 2026Updated 5 months ago
WangRongsheng / Med-R1
View on GitHub
Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.
☆26Apr 24, 2025Updated last year
Project-MONAI / VLM-Radiology-Agent-Framework
View on GitHub
☆220Sep 22, 2025Updated 10 months ago