zhaoshitian / Causal-CoGLinks
[CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"
☆15Updated last year
Alternatives and similar repositories for Causal-CoG
Users that are interested in Causal-CoG are comparing it to the libraries listed below
Sorting:
- OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding☆56Updated 2 months ago
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆89Updated 2 months ago
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆148Updated last year
- The code for paper: PeFoM-Med: Parameter Efficient Fine-tuning on Multi-modal Large Language Models for Medical Visual Question Answering☆55Updated 3 months ago
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆54Updated 3 months ago
- [ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks☆41Updated 2 months ago
- The official repository of the paper 'Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine'☆93Updated 8 months ago
- AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation☆41Updated 4 months ago
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆29Updated 10 months ago
- [MICCAI 2024] Can LLMs' Tuning Methods Work in Medical Multimodal Domain?☆17Updated last year
- ☆40Updated 10 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆74Updated 7 months ago
- Detail-Oriented CLIP for Fine-Grained Tasks (ICLR SSI-FM 2025)☆55Updated 6 months ago
- [CVPR 2024]Instance-level Expert Knowledge and Aggregate Discriminative Attention for Radiology Report Generation☆26Updated 11 months ago
- [CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering☆22Updated 5 months ago
- The collection of medical VLP papars☆19Updated last year
- [EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models☆91Updated 9 months ago
- The official GitHub repository of the survey paper "A Systematic Review of Deep Learning-based Research on Radiology Report Generation".☆91Updated 4 months ago
- ☆19Updated 4 months ago
- ☆50Updated 4 months ago
- Code for the paper "RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection" (ACL'25).☆24Updated 2 months ago
- A framework for Longitudinal Radiology Report Generation☆18Updated last year
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆31Updated 6 months ago
- MedEvalKit: A Unified Medical Evaluation Framework☆146Updated 3 weeks ago
- [CVPR2024] PairAug: What Can Augmented Image-Text Pairs Do for Radiology?☆30Updated 10 months ago
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆82Updated 3 months ago
- ☆20Updated last week
- ☆68Updated 2 months ago
- Code for the paper "ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning" (ACL'23).☆55Updated 11 months ago
- [NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models☆46Updated last year