VisionLearningGroup / JEDDi-Net
Implementation for "Joint Event Detection and Description in Continuous Video Streams"
☆23Updated 4 years ago
Alternatives and similar repositories for JEDDi-Net:
Users that are interested in JEDDi-Net are comparing it to the libraries listed below
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Updated 5 years ago
- video captioning☆24Updated 6 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆67Updated 5 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆60Updated 2 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Updated 5 years ago
- Code for the paper: "Sentence Specified Dynamic Video Thumbnail Generation"☆33Updated 5 years ago
- Feature Extraction Toolbox from CUHKÐZ&SIAT submission to ActivityNet 2016☆32Updated 6 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Updated 5 years ago
- ☆30Updated 6 years ago
- AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos. ECCV'18.☆75Updated 2 years ago
- Evaluation code for Dense-Captioning Events in Videos☆126Updated 5 years ago
- ☆32Updated 6 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Updated 4 years ago
- [CVPR2019] Dual Encoding for Zero-Example Video Retrieval☆154Updated 2 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Updated 6 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Updated 6 years ago
- Dense video captioning in PyTorch☆41Updated 5 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆16Updated 5 years ago
- Implementation for "Multilevel Language and Vision Integration for Text-to-Clip Retrieval"☆50Updated 6 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆55Updated 9 months ago
- Code for our ICML 2019 paper "Temporal Gaussian Mixture Layer for Videos"☆101Updated 5 years ago
- Code for "Video Re-localization" in ECCV 2018☆80Updated 5 years ago
- Here we describe a new approach to train a video captioning neural network , that is not only based on the normal cross entropy loss for …☆7Updated 5 years ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Updated 2 years ago
- ☆33Updated 6 years ago
- Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"☆44Updated 9 months ago
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Updated 2 years ago
- A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"☆96Updated 5 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆33Updated 5 years ago
- ☆42Updated 4 years ago