dsrestrepo / Foundational-Multimodal-Fusion-BenchmarkLinks
Proposed framework for multimodal data fusion
☆18Updated 7 months ago
Alternatives and similar repositories for Foundational-Multimodal-Fusion-Benchmark
Users that are interested in Foundational-Multimodal-Fusion-Benchmark are comparing it to the libraries listed below
Sorting:
- Symile is a flexible, architecture-agnostic contrastive loss that enables training modality-specific representations for any number of mo…☆46Updated 9 months ago
- LLaVa Version of RaDialog☆25Updated 7 months ago
- VQA-Med 2020☆16Updated 2 years ago
- Expert-level AI radiology report evaluator☆35Updated 9 months ago
- Bilingual Medical Mixture of Experts LLM☆31Updated last year
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆23Updated 8 months ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆25Updated 2 years ago
- ☆70Updated 6 months ago
- [IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge☆45Updated 7 months ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆25Updated 10 months ago
- ☆53Updated last year
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆35Updated 8 months ago
- CirrMRI600+: Large Scale MRI Collection and Segmentation of Cirrhotic Liver☆23Updated 8 months ago
- Surgical Visual Question Answering. A transformer-based surgical VQA model. Offical Implementation of "Surgical-VQA: Visual Question Answ…☆61Updated 2 years ago
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆31Updated last month
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆107Updated 2 months ago
- [EMNLP 2025] Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆55Updated 4 months ago
- ☆19Updated last year
- The code for paper: PeFoMed: Parameter Efficient Fine-tuning on Multi-modal Large Language Models for Medical Visual Question Answering☆57Updated 3 weeks ago
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).☆28Updated 7 months ago
- ☆21Updated 2 years ago
- A new collection of medical VQA dataset based on MIMIC-CXR. Part of the work 'EHRXQA: A Multi-Modal Question Answering Dataset for Electr…☆94Updated last year
- ☆32Updated last year
- [ISBI 2025] XLSTM-HVED: Cross-Modal Brain Tumor Segmentation and MRI Reconstruction Method Using Vision XLSTM and Heteromodal Variational…☆15Updated 6 months ago
- ☆36Updated last month
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Updated last year
- ☆43Updated last year
- [Arxiv-2024] CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation☆210Updated last year
- ☆25Updated 2 years ago
- This repository is made for the paper: Self-supervised vision-language pretraining for Medical visual question answering☆42Updated 2 years ago