GaryGuTC/LaPA_model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GaryGuTC/LaPA_model)

GaryGuTC / LaPA_model

[CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering

☆27

Alternatives and similar repositories for LaPA_model

Users that are interested in LaPA_model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Awenbocc / CPCR
View on GitHub
☆15Mar 11, 2023Updated 3 years ago
pengfeiliHEU / MUMC
View on GitHub
This repository is made for the paper: Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medica…
☆48Jul 10, 2024Updated 2 years ago
ecoxial2007 / FGRW_MedVQA
View on GitHub
Fine-Grained Knowledge Fusion for Retrieval-Augmented Medical Visual Question
☆11Jul 18, 2024Updated 2 years ago
vuhoangminh / vqa_medical
View on GitHub
☆10Oct 20, 2022Updated 3 years ago
thomaswei-cn / MC-CoT
View on GitHub
MC-CoT implementation code
☆23Jun 24, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Holipori / Medical-CXR-VQA
View on GitHub
☆46Jan 21, 2025Updated last year
haifangong / CMSA-MTPT-4-MedicalVQA
View on GitHub
[ICMR'21, Best Poster Paper Award] Medical Visual Question Answering with Multi-task Pre-training and Cross-modal Self-attention
☆34Dec 15, 2022Updated 3 years ago
Holipori / EKAID
View on GitHub
code for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering
☆29May 30, 2025Updated last year
liubo105 / SAT
View on GitHub
Improving Medical Vision-Language Contrastive Pretraining with Semantics-aware Triage
☆11Jun 25, 2023Updated 3 years ago
LX-doctorAI1 / GSKET
View on GitHub
☆35Nov 22, 2022Updated 3 years ago
xiaoxing2001 / DeGLA
View on GitHub
[ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]
☆16Jul 15, 2025Updated last year
Snowinbio / LLNM-Net
View on GitHub
Thyroid Multimodal Deep Learning Net, the Multimodal Deep Learning based Transformer for Thyroid Cancer Latral Lymph Node Metastasis Risk…
☆17Oct 20, 2025Updated 9 months ago
deepglint / UniDoc-RL
View on GitHub
UniDoc-RL: Unified Document Understanding with Reinforcement Learning
☆16May 21, 2026Updated 2 months ago
alibabadoufu / dynamic_fusion_reimplementation
View on GitHub
Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
☆17Oct 30, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Sliver-g / Cardiac-CLIP
View on GitHub
☆28Jan 22, 2026Updated 6 months ago
ytaek-oh / fsc-clip
View on GitHub
[EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
☆22Oct 8, 2024Updated last year
LX-doctorAI1 / M2KT
View on GitHub
☆40Mar 15, 2023Updated 3 years ago
tossowski / MultimodalPromptRetrieval
View on GitHub
☆16Feb 5, 2024Updated 2 years ago
dhruvsharma15 / MEDVQA
View on GitHub
☆10Aug 31, 2021Updated 4 years ago
ssyze / EVE
View on GitHub
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
☆10Mar 1, 2024Updated 2 years ago
tjvsonsbeek / open-ended-medical-vqa
View on GitHub
Repository for the paper: Open-Ended Medical Visual Question Answering Through Prefix Tuning of Language Models (https://arxiv.org/abs/23…
☆19Sep 2, 2023Updated 2 years ago
chenzcv7 / MOTOR
View on GitHub
☆21May 4, 2023Updated 3 years ago
wjhou / Recap
View on GitHub
[EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning
☆28Jun 12, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
LijunRio / AG-KD
View on GitHub
This repository contains the code for our paper: Enhancing Abnormality Grounding for Vision-Language Models with Knowledge Descriptions
☆19Jun 24, 2025Updated last year
anxiangsir / Video_Benchmark_Suite
View on GitHub
Video Benchmark Suite: Rapid Evaluation of Video Foundation Models
☆17Jan 10, 2025Updated last year
aioz-ai / MICCAI21_MMQ
View on GitHub
Multiple Meta-model Quantifying for Medical Visual Question Answering (MICCAI 2021)
☆37Apr 21, 2026Updated 3 months ago
wjhou / ORGan
View on GitHub
[ACL 2023] ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning
☆55Oct 3, 2024Updated last year
phymhan / S2D2
View on GitHub
☆16Jun 17, 2026Updated last month
Holipori / MIMIC-Diff-VQA
View on GitHub
☆73Feb 3, 2025Updated last year
genmilab / MedMO
View on GitHub
MedMO: Medical Foundation Model
☆26Apr 8, 2026Updated 3 months ago
GaryGuTC / COMG_model
View on GitHub
[WACV 2024] Complex Organ Mask Guided Radiology Report Generation
☆43Nov 10, 2025Updated 8 months ago
JXLiu-AI / MedCoT
View on GitHub
☆42Dec 8, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zhjohnchan / PTUnifier
View on GitHub
[ICCV-2023] Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts
☆78Mar 22, 2024Updated 2 years ago
mahmoodlab / multimodal-cancer-origin-prediction
View on GitHub
Deep learning-based multimodal integration of histology and genomics to improves cancer origin prediction
☆27Mar 28, 2023Updated 3 years ago
TIGER-AI-Lab / ABC
View on GitHub
ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]
☆19Aug 21, 2025Updated 11 months ago
HappyEureka / mcrafter
View on GitHub
multi-agent crafter for cooperative tasks
☆14Aug 2, 2025Updated 11 months ago
deepglint / Victor
View on GitHub
ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs
☆29Aug 15, 2025Updated 11 months ago
maifoundations / Visionary-R1
View on GitHub
Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning
☆44Jul 2, 2025Updated last year
OpenMICG / FAVP
View on GitHub
☆16Sep 17, 2025Updated 10 months ago