crux82 / msr-vtt-itLinks
A large scale dataset for Video Captioning in Italian
☆12Updated 2 years ago
Alternatives and similar repositories for msr-vtt-it
Users that are interested in msr-vtt-it are comparing it to the libraries listed below
Sorting:
- CoCon: Cooperative Contrastive Learning☆20Updated 2 years ago
- ☆24Updated 4 years ago
- Code to perform shot detection and extraction on video☆11Updated 3 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Updated 4 years ago
- ☆20Updated 3 years ago
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)☆11Updated 2 years ago
- Code for paper <Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation> in ICCV 2021.☆13Updated 3 years ago
- ☆43Updated 4 years ago
- CLIP-It! Language-Guided Video Summarization☆74Updated 4 years ago
- ☆35Updated last year
- ☆29Updated last year
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Updated 4 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 4 years ago
- ☆19Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- The project is about predicting sets (of classes) from images.☆22Updated 3 years ago
- ☆11Updated 4 years ago
- ☆32Updated 6 years ago
- Shapley values for assessing the importance of each frame in a video☆17Updated 4 years ago
- ☆22Updated last year
- ☆22Updated 3 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆20Updated 6 years ago
- Dense video captioning in PyTorch☆41Updated 5 years ago
- Feature Re-Learning with Data Augmentation for Video Relevance Prediction☆20Updated 2 years ago
- Use CLIP to represent video for Retrieval Task☆70Updated 4 years ago
- Phrase Localization Evaluation Toolkit☆20Updated 5 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated 2 years ago
- When can you tell whether an image has been cropped or not?☆29Updated 3 years ago
- Implementations of Transformers for Video☆23Updated 4 years ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆25Updated 3 years ago