crux82 / msr-vtt-it
A large scale dataset for Video Captioning in Italian
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for msr-vtt-it
- ☆41Updated 3 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Updated 3 years ago
- Dense video captioning in PyTorch☆41Updated 5 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆60Updated last year
- Here we describe a new approach to train a video captioning neural network , that is not only based on the normal cross entropy loss for …☆8Updated 4 years ago
- MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.☆26Updated 5 years ago
- ☆31Updated 6 years ago
- ☆24Updated 3 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Updated 3 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆57Updated 3 years ago
- ☆19Updated last year
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆38Updated 2 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆21Updated 5 years ago
- Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*☆29Updated 3 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 3 years ago
- Video captioning baseline models on Video2Commonsense Dataset.☆57Updated 3 years ago
- Improved evaluation codes for common visual captioning metrics.☆11Updated 2 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated last year
- Deep Learning for Video Retrieval by Natural Language☆11Updated 5 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Updated 4 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆44Updated 4 months ago
- [ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation☆68Updated 2 years ago
- ☆11Updated 4 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- implementation of TDConvED for video captioning☆13Updated 4 years ago
- CLIP-It! Language-Guided Video Summarization☆73Updated 3 years ago
- ☆35Updated last year
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆40Updated last year
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 3 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Updated last year