OpenMICG / CoCoMeDLinks
Consistency Conditioned Memory Augmented Dynamic Diagnosis Model for Medical Visual Question Answering
☆13Updated last year
Alternatives and similar repositories for CoCoMeD
Users that are interested in CoCoMeD are comparing it to the libraries listed below
Sorting:
- [ECCV2024] Nonverbal Interaction Detection☆27Updated 8 months ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆35Updated 2 months ago
- A Video-to-Text Framework☆10Updated last year
- Multigranularity Contrastive cross-modal collaborative Generation (MCG) model for Video QA☆11Updated last year
- [IJCV] AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation☆20Updated last year
- ☆29Updated last year
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Updated last year
- ☆23Updated 3 years ago
- ☆46Updated last year
- Video Graph Transformer for Video Question Answering (ECCV'22)☆48Updated 2 years ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆21Updated 5 months ago
- ☆19Updated 2 years ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆47Updated 2 years ago
- Official implementations of our LaZSL (ICCV'25)☆20Updated 2 weeks ago
- ☆92Updated 3 years ago
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆69Updated 3 years ago
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval☆42Updated 3 years ago
- This is the pytorch implementation of WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos (CVPR2021).☆12Updated 2 months ago
- Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …☆59Updated 2 years ago
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Updated 2 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated last year
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆29Updated 7 months ago
- ICCV 2021: A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph ge…☆62Updated 3 years ago
- ☆14Updated last year
- [ICCV 2021] Official PyTorch implementation for "D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised…☆9Updated 3 years ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆66Updated 3 years ago
- The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".☆19Updated 2 years ago
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆23Updated last year
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Updated 2 years ago
- Some papers about *diverse* image (a few videos) captioning☆26Updated 2 years ago