[ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging
☆39Jun 4, 2025Updated 9 months ago
Alternatives and similar repositories for Med-MAT
Users that are interested in Med-MAT are comparing it to the libraries listed below
Sorting:
- This repository is the official implementation of the TrafficGamer.☆31Nov 22, 2024Updated last year
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆52Nov 20, 2024Updated last year
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Oct 28, 2025Updated 4 months ago
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆30Jul 9, 2025Updated 8 months ago
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 7 months ago
- ☆22Nov 27, 2025Updated 3 months ago
- [NAACL 2025] VividMed: Vision Language Model with Versatile Visual Grounding for Medicine☆28Mar 10, 2025Updated last year
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆29Mar 11, 2025Updated last year
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆20Aug 28, 2025Updated 6 months ago
- Offical Code of MICCAI'25 Best-Paper-Shortlist paper "MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group…☆38Sep 28, 2025Updated 5 months ago
- The official codes for "Can Modern LLMs Act as Agent Cores in Radiology Environments?"☆28Jan 22, 2025Updated last year
- ☆41Jan 28, 2026Updated last month
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆85Jun 4, 2025Updated 9 months ago
- The official repository of the paper 'Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine'☆122Jan 9, 2025Updated last year
- ☆16Jan 28, 2026Updated last month
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- ☆25Jan 11, 2025Updated last year
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆45Oct 18, 2025Updated 5 months ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 10 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- An interpretable large language model (LLM) for medical diagnosis.☆160Sep 12, 2024Updated last year
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Ma…☆13Sep 13, 2024Updated last year
- Chest X-Ray Explainer (ChEX)☆23Jan 30, 2025Updated last year
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆87Jan 4, 2026Updated 2 months ago
- ECCV[2024] "Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model" official implement☆16Jul 15, 2025Updated 8 months ago
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆112Oct 28, 2025Updated 4 months ago
- ☆20Jan 3, 2025Updated last year
- [ISBI 2025] Design Data Before Models: Using large vision-language models to automatically enhance medical dataset annotations.☆35Jan 28, 2026Updated last month
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.☆82Dec 17, 2024Updated last year
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Oct 12, 2024Updated last year
- ☆11Jun 21, 2025Updated 8 months ago
- [ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology☆77Jan 26, 2026Updated last month
- Official implementation of "MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation"☆26Feb 22, 2026Updated 3 weeks ago
- CVPR2026☆25Sep 18, 2025Updated 6 months ago
- A nnU-Netv2 based acceleration solution for Abdominal organs and tumor segmentation.☆20Nov 24, 2023Updated 2 years ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆23Sep 19, 2024Updated last year
- [Sci. Rep. 2025] Revisiting model scaling with a U-net benchmark for 3D medical image segmentation☆18Aug 21, 2025Updated 6 months ago
- ☆69Feb 3, 2025Updated last year
- HealthFlow: A Self-Evolving AI Agent with Meta Planning for Autonomous Healthcare Research☆37Nov 26, 2025Updated 3 months ago