OpenMICG / mcgLinks
Multigranularity Contrastive cross-modal collaborative Generation (MCG) model for Video QA
☆11Updated 2 years ago
Alternatives and similar repositories for mcg
Users that are interested in mcg are comparing it to the libraries listed below
Sorting:
- Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering [ACM MM'24]☆10Updated last year
- ☆12Updated 2 years ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆83Updated last year
- ☆46Updated last year
- paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Vi…