yugaljain1999 / Video_Captioning_Pytorch
Video captioning on MSR-VTT Dataset
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Video_Captioning_Pytorch
- ☆17Updated 2 years ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Updated last year
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 3 years ago
- [ECCV'22 Poster] Explicit Image Caption Editing☆21Updated last year
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆86Updated 3 years ago
- Source code of Universal Weighting Metric Learning for Cross-Modal Matching. The paper is accepted by CVPR2020.☆22Updated 2 years ago
- [AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing☆24Updated last year
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆22Updated 2 years ago
- ☆25Updated 3 years ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated last year
- ☆31Updated 3 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated last year
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆46Updated 3 years ago
- 🏆 The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.☆17Updated 3 years ago
- Phrase Localization Evaluation Toolkit☆19Updated 5 years ago
- ☆19Updated last year
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)☆45Updated last year
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 2 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆13Updated 3 years ago
- The codes and features of the re-implementation of SIGIR 2021 work "Deconfounded Video Moment Retrieval with Causal Intervention"☆35Updated 3 years ago
- ☆43Updated 2 years ago
- [ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and …☆60Updated 2 years ago
- Cross Modal Retrieval with Querybank Normalisation☆53Updated 11 months ago
- The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).☆16Updated 2 years ago
- Some papers about *diverse* image (a few videos) captioning☆25Updated last year
- A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval☆42Updated 2 years ago
- Unsupervised Film Genre Classification using Spatio-Temporal Contrastive Learning☆31Updated last year
- [ECCV2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval☆76Updated last year
- DeVLBert: Learning Deconfounded Visio-Linguistic Representations☆27Updated last year
- A pytorch implemetation of data augmentation method for visual question answering☆21Updated last year