WillDreamer/Awesome-MLLM-Reasoning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WillDreamer/Awesome-MLLM-Reasoning)

WillDreamer / Awesome-MLLM-Reasoning

Recent Advances on MLLM's Reasoning Ability

☆26

Alternatives and similar repositories for Awesome-MLLM-Reasoning

Users that are interested in Awesome-MLLM-Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ASGMVLP / ASGMVLP_CODE
View on GitHub
The repo of ASGMVLP
☆19Jan 16, 2026Updated 6 months ago
hlk-1135 / RadGraph
View on GitHub
RadGraph: Extracting Clinical Entities and Relations from Radiology Reports
☆14Nov 22, 2022Updated 3 years ago
guanjinquan / CXRTrek
View on GitHub
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight
☆13May 26, 2025Updated last year
ahmdtaha / distributed_sigmoid_loss
View on GitHub
Unofficial implementation for Sigmoid Loss for Language Image Pre-Training
☆11Sep 26, 2023Updated 2 years ago
SZUHvern / MaCo
View on GitHub
The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Ma…
☆12Sep 13, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gentlefress / MLIP
View on GitHub
The code of paper "MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning" accep…
☆10Mar 5, 2024Updated 2 years ago
Tang-xiaoxiao / Medthink
View on GitHub
[ 🎯 NAACL 2025 ] MedThink: A Rationale-Guided Framework for Explaining Medical Visual Question Answering
☆18Jun 15, 2026Updated last month
mbzuai-oryx / MIRA
View on GitHub
[ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…
☆23Aug 28, 2025Updated 11 months ago
LinjieMu / MMXU
View on GitHub
☆25Nov 27, 2025Updated 8 months ago
SUSTechBruce / Med-UniC
View on GitHub
official implementation of "Med-Unic: unifying cross-lingual medical vision-language pre-training by diminishing bias"
☆18Sep 22, 2023Updated 2 years ago
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated last year
uni-medical / GMAI-VL-R1
View on GitHub
☆19Jul 21, 2025Updated last year
MME-Benchmarks / MME-CoT
View on GitHub
MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency
☆136Aug 5, 2025Updated 11 months ago
CUHK-AIM-Group / MCPL
View on GitHub
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆13Apr 17, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
sunxm2357 / DIME-FM
View on GitHub
Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"
☆15Oct 12, 2023Updated 2 years ago
qirui-chen / RGA3-release
View on GitHub
[ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring
☆24Aug 8, 2025Updated 11 months ago
PerceptionComputingLab / MedFILIP
View on GitHub
[IEEE-JBHI 2025] Pytorch implementation of the paper "MedFILIP: Medical Fine-Grained Language-Image Pre-Training s"
☆26Jan 18, 2025Updated last year
HKU-MedAI / HERGen
View on GitHub
[ECCV'2024] HERGen: Elevating Radiology Report Generation with Longitudinal Data
☆31Jan 25, 2026Updated 6 months ago
rajpurkarlab / ReXKG
View on GitHub
☆17Sep 23, 2024Updated last year
Richar-Du / Virgo
View on GitHub
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆20May 27, 2025Updated last year
Becomebright / GroundVQA
View on GitHub
Official PyTorch code of GroundVQA (CVPR'24)
☆63Sep 13, 2024Updated last year
AI-in-Health / ClinicBench
View on GitHub
[EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"
☆36May 2, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
baopj / Vid-Morp
View on GitHub
☆12Dec 6, 2024Updated last year
Code-kunkun / ZS-CIR
View on GitHub
[BMVC 2023] Zero-shot Composed Text-Image Retrieval
☆55Nov 26, 2024Updated last year
Code-kunkun / LamRA
View on GitHub
[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
☆182Jul 7, 2025Updated last year
RL4M / MRM-pytorch
View on GitHub
An official implementation of Advancing Radiograph Representation Learning with Masked Record Modeling (ICLR'23)
☆77Feb 21, 2023Updated 3 years ago
MAGIC-AI4Med / ChestX-Reasoner
View on GitHub
☆39Mar 19, 2026Updated 4 months ago
NJU-LINK / MT-Video-Bench
View on GitHub
The Source Code for MT-Video-Bench @ ACL Findings 2026
☆22Jan 20, 2026Updated 6 months ago
WillDreamer / Aurora
View on GitHub
[NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model
☆90Nov 28, 2023Updated 2 years ago
cwangrun / CheXficient
View on GitHub
CheXficient
☆15Jun 28, 2026Updated last month
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
qirui-chen / MultiHop-EgoQA
View on GitHub
[AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
☆38May 27, 2025Updated last year
PlusLabNLP / VISCO
View on GitHub
[CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
☆13Jun 7, 2025Updated last year
wjhou / ICon
View on GitHub
[EMNLP 2024 Findings] ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation
☆19Dec 11, 2024Updated last year
Liqq1 / AOR
View on GitHub
AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation
☆52Jan 20, 2026Updated 6 months ago
ZrH42 / UniX
View on GitHub
☆31Mar 29, 2026Updated 4 months ago
StanfordMIMI / MedVAL
View on GitHub
Toward Expert-Level Medical Text Validation with Language Models
☆18Oct 23, 2025Updated 9 months ago
HKUSTGZ-ML4Health-Lab / Med-Scout
View on GitHub
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
☆16Feb 8, 2026Updated 5 months ago