crux82 / msr-vtt-itLinks
A large scale dataset for Video Captioning in Italian
☆12Updated 2 years ago
Alternatives and similar repositories for msr-vtt-it
Users that are interested in msr-vtt-it are comparing it to the libraries listed below
Sorting:
- ☆43Updated 4 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Updated 4 years ago
- Here we describe a new approach to train a video captioning neural network , that is not only based on the normal cross entropy loss for …☆7Updated 5 years ago
- Dense video captioning in PyTorch☆41Updated 5 years ago
- ☆20Updated 3 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆60Updated 2 years ago
- A curated list of the Video Summarization subject which is a computer science using machine learning and deep learning☆42Updated 5 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆45Updated 11 months ago
- ☆22Updated last year
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Updated 4 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 4 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆20Updated 5 years ago
- CoCon: Cooperative Contrastive Learning☆20Updated 2 years ago
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)☆11Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"☆14Updated 2 years ago
- Epic Kitchens Object Detector and Feature Extractor using Faster-RCNN with Detectron2☆22Updated 4 years ago
- ☆19Updated 2 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆60Updated 3 years ago
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Updated 4 years ago
- ☆29Updated last year
- MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.☆26Updated 5 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 3 years ago
- UniVSE implementation on Python3☆10Updated 4 years ago
- This repo is used for downloading the videos for SVD dataset.☆18Updated 4 years ago
- Implementations of Transformers for Video☆23Updated 4 years ago
- 🏆 The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.☆17Updated 4 years ago
- ☆32Updated 6 years ago
- CLIP-It! Language-Guided Video Summarization☆74Updated 3 years ago
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆47Updated 3 years ago