rayruizhiliao/mutual_info_img_txt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rayruizhiliao/mutual_info_img_txt)

rayruizhiliao / mutual_info_img_txt

Joint learning of images and text via maximization of mutual information

☆19

Alternatives and similar repositories for mutual_info_img_txt

Users that are interested in mutual_info_img_txt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rayruizhiliao / joint_chestxray
View on GitHub
Joint learning of chest radiographs and radiology reports
☆26Dec 30, 2021Updated 4 years ago
philip-mueller / lovt
View on GitHub
Localized representation learning from Vision and Text (LoVT)
☆33Jul 2, 2024Updated 2 years ago
ChenXiaoFei-CS / KoBo
View on GitHub
Official implementation of MICCAI2023【Knowledge Boosting: Rethinking Medical Contrastive Vision-Langauge Pre-training】
☆16Mar 19, 2024Updated 2 years ago
liubo105 / SAT
View on GitHub
Improving Medical Vision-Language Contrastive Pretraining with Semantics-aware Triage
☆11Jun 25, 2023Updated 3 years ago
marshuang80 / gloria
View on GitHub
GLoRIA: A Multimodal Global-Local Representation Learning Framework forLabel-efficient Medical Image Recognition
☆242Feb 6, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
microsoft / Do-You-See-Me
View on GitHub
☆13Jun 21, 2025Updated last year
YtongXie / X-RGen
View on GitHub
[ACCV2024 (Oral)] Official pytorch implementation of X-RGen
☆18Jan 20, 2025Updated last year
yangzhou12 / BenchX
View on GitHub
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays
☆48Dec 27, 2025Updated 6 months ago
IreneZihuiLi / EHRKit-2022
View on GitHub
A Python Natural Language Processing Toolkit for Electronic Health Record Texts
☆13May 24, 2023Updated 3 years ago
chl8856 / DeepIMV
View on GitHub
A Variational Information Bottleneck Approach to Multi-Omics Data Integration
☆24May 11, 2021Updated 5 years ago
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆11Jul 28, 2025Updated 11 months ago
funnyzhou / REFERS
View on GitHub
☆112Aug 17, 2022Updated 3 years ago
mbzuai-oryx / MIRA
View on GitHub
[ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…
☆23Aug 28, 2025Updated 10 months ago
chenzcv7 / MOTOR
View on GitHub
☆21May 4, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
guanjinquan / CXRTrek
View on GitHub
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight
☆13May 26, 2025Updated last year
YH0517 / AFLoc
View on GitHub
☆42Jan 12, 2026Updated 6 months ago
rajpurkarlab / ReXKG
View on GitHub
☆17Sep 23, 2024Updated last year
Tang-xiaoxiao / 3D-RAD
View on GitHub
[ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks
☆32Jun 22, 2026Updated 2 weeks ago
jdh-algo / Citrus-V
View on GitHub
Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning
☆24Sep 26, 2025Updated 9 months ago
MrGiovanni / PanTS
View on GitHub
[NeurIPS 2025] PanTS: The Pancreatic Tumor Segmentation Dataset. PanTS is a vision-language dataset, which enables development and extern…
☆115Jun 10, 2026Updated last month
MediaBrain-SJTU / MedKLIP
View on GitHub
The official code for MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training in Radiology. We propose to leverage medical specif…
☆180Sep 4, 2023Updated 2 years ago
zhjohnchan / M3AE
View on GitHub
[MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.
☆134Sep 16, 2022Updated 3 years ago
cheliu-computation / M-FLAG-MICCAI2023
View on GitHub
☆22Aug 1, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dek924 / PatientSim
View on GitHub
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)
☆38Apr 9, 2026Updated 3 months ago
X-iZhang / Libra
View on GitHub
[ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…
☆30Mar 18, 2026Updated 3 months ago
LinjieMu / MMXU
View on GitHub
☆25Nov 27, 2025Updated 7 months ago
jbdel / vilmedic
View on GitHub
ViLMedic (Vision-and-Language medical research) is a modular framework for vision and language multimodal research in the medical field
☆189Oct 9, 2025Updated 9 months ago
NVIDIA-Medtech / NV-Reason-CXR
View on GitHub
🩻 NV-Reason-CXR-3B is a specialized vision-language model designed for medical reasoning and interpretation of chest X-ray images.
☆59Feb 25, 2026Updated 4 months ago
med-air / TOP-GPM
View on GitHub
☆13Aug 7, 2024Updated last year
Qybc / MedBLIP
View on GitHub
☆57Feb 23, 2024Updated 2 years ago
wbw520 / DiReCT
View on GitHub
DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models (NeurIPS 2024 D&B Track)
☆24Mar 6, 2025Updated last year
thomaswei-cn / MC-CoT
View on GitHub
MC-CoT implementation code
☆23Jun 24, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
YueYANG1996 / LaBo
View on GitHub
CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification
☆108May 28, 2024Updated 2 years ago
Markin-Wang / XProNet
View on GitHub
[ECCV2022] The official implementation of Cross-modal Prototype Driven Network for Radiology Report Generation
☆84Dec 27, 2024Updated last year
rajpurkarlab / CXR-Report-Metric
View on GitHub
☆78Apr 23, 2024Updated 2 years ago
EIDOSLAB / contrastive-brain-age-prediction
View on GitHub
Code for the paper "Contrastive learning for regression in multi-site brain age prediction" | ISBI 2023 https://doi.org/10.1109/ISBI53787…
☆13May 5, 2023Updated 3 years ago
UCSB-AI / ProbMed
View on GitHub
Official repository for the ACL 2025 Findings paper "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal M…
☆26May 12, 2026Updated 2 months ago
MedHK23 / IMT-CXR
View on GitHub
☆20Jan 3, 2025Updated last year
ritaranx / AceSearcher
View on GitHub
This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotl…
☆25Sep 29, 2025Updated 9 months ago