yugaljain1999 / Video_Captioning_PytorchLinks
Video captioning on MSR-VTT Dataset
☆12Updated 4 years ago
Alternatives and similar repositories for Video_Captioning_Pytorch
Users that are interested in Video_Captioning_Pytorch are comparing it to the libraries listed below
Sorting:
- ☆17Updated 3 years ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Updated 2 years ago
- ☆19Updated 3 years ago
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆47Updated 4 years ago
- Official Implementation of CoSMo: Content-Style Modulation for Image Retrieval with Text Feedback presented in CVPR 2021.☆66Updated 3 years ago
- The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).☆17Updated 3 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Updated 4 years ago
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Updated last year
- Unsupervised Film Genre Classification using Spatio-Temporal Contrastive Learning☆32Updated 2 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Updated 3 years ago
- The code for the paper "Contrastive Quantization with Code Memory for Unsupervised Image Retrieval" (AAAI'22, Oral).☆38Updated 3 years ago
- This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It wil…☆18Updated 4 years ago
- [ICCV 2021] Released code for Causal Attention for Unbiased Visual Recognition☆80Updated 2 years ago
- Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).☆141Updated 3 years ago
- ☆19Updated 8 months ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆54Updated 2 years ago
- ☆31Updated 4 years ago
- Cross Modal Retrieval with Querybank Normalisation☆57Updated 2 years ago
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆24Updated 3 years ago
- ☆47Updated this week
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated 2 years ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆107Updated last year
- ☆38Updated 3 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆268Updated last year
- implementation of paper https://arxiv.org/abs/2210.04559☆56Updated last month
- super image for action recognition☆56Updated 3 years ago
- MixGen: A New Multi-Modal Data Augmentation☆126Updated 3 years ago
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval." CVPR 2022☆115Updated 3 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆95Updated 3 years ago
- Some papers about *diverse* image (a few videos) captioning☆26Updated 2 years ago