junjie-shentu / CXR-IRGenLinks
Implementation of the paper "CXR-IRGen: An Integrated Vision and Language Model for the Generation of Clinically Accurate Chest X-Ray Image-Report Pairs" (WACV 2024)
☆20Updated last year
Alternatives and similar repositories for CXR-IRGen
Users that are interested in CXR-IRGen are comparing it to the libraries listed below
Sorting:
- ☆25Updated last year
- [CVPR2024] PairAug: What Can Augmented Image-Text Pairs Do for Radiology?☆30Updated last year
- [CHIL 2024] ViewXGen: Vision-Language Generative Model for View-Specific Chest X-ray Generation☆55Updated last year
- ☆22Updated 2 years ago
- Multi-Aspect Vision Language Pretraining - CVPR2024☆87Updated last year
- Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation☆17Updated last year
- ☆70Updated 6 months ago
- ☆21Updated 8 months ago
- Localized representation learning from Vision and Text (LoVT)☆31Updated last year
- ☆31Updated last year
- [CVPR 2024] FairCLIP: Harnessing Fairness in Vision-Language Learning☆98Updated 6 months ago
- 【IEEE TPAMI 2025】Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding☆29Updated last week
- [MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.☆128Updated 3 years ago
- [ICCV-2023] Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts☆77Updated last year
- Official code for "LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation"☆143Updated 2 years ago
- The official GitHub repository of the AAAI-2024 paper "Bootstrapping Large Language Models for Radiology Report Generation".☆65Updated last year
- Official Code for Contrastive Learning with Counterfactual Explanations for Radiology Report Generation (ECCV 2024)☆16Updated 9 months ago
- [ECCV'2024] HERGen: Elevating Radiology Report Generation with Longitudinal Data☆29Updated last week
- The code for paper: PeFoMed: Parameter Efficient Fine-tuning on Multi-modal Large Language Models for Medical Visual Question Answering☆59Updated last month
- ☆21Updated last year
- ☆118Updated last year
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆34Updated last year
- ☆20Updated 2 months ago
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆60Updated 6 months ago
- BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays☆45Updated last month
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Updated 3 months ago
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆77Updated last year
- [ICML'25] MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization☆67Updated 7 months ago
- The official start-up code for paper "FFA-IR: Towards an Explainable and Reliable Medical Report Generation Benchmark."☆65Updated last year
- [ECCV2022] The official implementation of Cross-modal Prototype Driven Network for Radiology Report Generation☆80Updated last year