KoDohwan / VT-TWINS
Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)
☆10Updated last year
Related projects: ⓘ
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆26Updated 2 years ago
- ☆19Updated last year
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 3 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Updated 3 years ago
- ☆27Updated last year
- CVPR’2022 Kinetics-GEBD Challenge☆10Updated 2 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 2 years ago
- Video captioning on MSR-VTT Dataset☆12Updated 3 years ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆36Updated last year
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆46Updated 3 years ago
- ☆9Updated last year
- ☆25Updated 3 years ago
- source code of our RaNet in EMNLP 2021☆30Updated 2 years ago
- Modality-Agnostic Attention Fusion for visual search with text feedback☆25Updated last year
- ☆41Updated 3 years ago
- 🏆 The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.☆17Updated 3 years ago
- [CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation☆17Updated 3 years ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆19Updated 2 months ago
- ☆35Updated 11 months ago
- 一个近几年来各大视觉顶会关于视频文本检索的库,同步我的博客:https://blog.csdn.net/AAliuxiaolei/article/details/121433833☆14Updated 2 years ago
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆61Updated 2 years ago
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning☆14Updated 9 months ago
- Research code for "Training Vision-Language Transformers from Captions Alone"☆34Updated 2 years ago
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆38Updated 9 months ago
- ☆21Updated 3 years ago
- ☆31Updated 5 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Updated last year
- [ECCV'22 Poster] Explicit Image Caption Editing☆21Updated last year
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆21Updated 2 years ago
- ☆21Updated 3 years ago