OpenMICG / AHPLinks
Adapter-Enhanced Hierarchical Cross-Modal Pre-training for Lightweight Medical Report Generation
☆12Updated 6 months ago
Alternatives and similar repositories for AHP
Users that are interested in AHP are comparing it to the libraries listed below
Sorting:
- A consistent Med-VQA dataset, C-SLAKE , extended by Slake for further consistency assessment .☆13Updated last year
- Multigranularity Contrastive cross-modal collaborative Generation (MCG) model for Video QA☆11Updated last year
- Observation Driven Memory Synergistic Planning for Continuous Vision-Language Navigation☆11Updated last year
- Consistency Conditioned Memory Augmented Dynamic Diagnosis Model for Medical Visual Question Answering☆13Updated last year
- ☆17Updated last year
- SotA text-only image/video method (IJCAI 2023)☆16Updated last year
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Updated 2 years ago
- Local self-attention in Transformer for visual question answering☆12Updated last year
- ☆11Updated 2 years ago
- Code for Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning☆15Updated last year
- Pytorch implementation of paper "Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning".☆9Updated 2 years ago
- A curated publication list on visual dialog☆14Updated 2 years ago
- CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning☆64Updated last year
- Papers and Public Datasets for Medical Vision-Language Learning☆17Updated 2 years ago
- This repo is the official implementation of the paper titled "Automatic Radiology Report Generation by Learning with Increasingly Hard Ne…☆8Updated 7 months ago
- The official implementation of “Cross-Modal Causal Representation Learning for Radiology Report Generation” (IEEE T-IP 2025)☆53Updated 2 months ago
- Biomedical Image Captioning☆59Updated 2 years ago
- The code for our ACL-2022 paper titled "Reinforced Cross-modal Alignment for Radiology Report Generation"☆23Updated 2 years ago
- Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025☆13Updated 2 weeks ago
- Progressive Transformer-Based Generation of Radiology Reports☆25Updated 6 months ago
- Improving Chest X-Ray Report Generation by Leveraging Warm-Starting☆70Updated last year
- ☆35Updated last year
- ☆11Updated 2 years ago
- [TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…☆18Updated 2 months ago
- ☆16Updated last year
- On the Importance of Image Encoding in Automated Chest X-Ray Report Generation, BMVC 2022☆16Updated 2 years ago
- Code repository for "Post-pre-training for Modality Alignment in Vision-Language Foundation Models" (CVPR2025)☆23Updated this week
- [ACL-2021] The official implementation of Cross-modal Memory Networks for Radiology Report Generation.☆101Updated last year
- [EMNLP-2020] The official implementation of Generating Radiology Reports via Memory-driven Transformer.☆110Updated last year
- The code of paper "MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning" accep…☆9Updated last year