dsrestrepo / Foundational-Multimodal-Fusion-BenchmarkLinks
Proposed framework for multimodal data fusion
☆18Updated 6 months ago
Alternatives and similar repositories for Foundational-Multimodal-Fusion-Benchmark
Users that are interested in Foundational-Multimodal-Fusion-Benchmark are comparing it to the libraries listed below
Sorting:
- [IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge☆46Updated 6 months ago
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Updated 11 months ago
- LLaVa Version of RaDialog☆24Updated 6 months ago
- Symile is a flexible, architecture-agnostic contrastive loss that enables training modality-specific representations for any number of mo…☆46Updated 8 months ago
- Expert-level AI radiology report evaluator☆35Updated 8 months ago
- ☆51Updated last year
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆60Updated 2 years ago
- ☆19Updated last year
- [MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"☆46Updated 2 weeks ago
- Official implementation of "EchoTracker: Advancing Myocardial Point Tracking in Echocardiography". (MICCAI 2024)☆50Updated last year
- ☆25Updated last year
- ☆44Updated last year
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆37Updated 2 months ago
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆32Updated 7 months ago
- ☆31Updated last year
- ☆19Updated 4 months ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆24Updated 9 months ago
- Official code for the Paper "RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance"☆106Updated 6 months ago
- Medical image captioning using OpenAI's CLIP☆88Updated 2 years ago
- Code for the paper "ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning" (ACL'23).☆55Updated last year
- Repo about the MultiCaRe Dataset, with demo notebooks and details about how it was created.☆67Updated last month
- Path to Medical AGI: Unify Domain-specific Medical LLMs with the Lowest Cost☆38Updated 2 years ago
- ☆29Updated last year
- The repo of the paper: Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medic…☆11Updated 2 years ago
- VQA-Med 2020☆16Updated 2 years ago
- vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)☆57Updated 2 months ago
- [MICCAI 2024, top 11%] Official Pytorch implementation of Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and …☆75Updated 3 weeks ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆53Updated 2 months ago
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆23Updated 7 months ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆25Updated 2 years ago