Implementation for "Joint Event Detection and Description in Continuous Video Streams"
☆23Nov 4, 2020Updated 5 years ago
Alternatives and similar repositories for JEDDi-Net
Users that are interested in JEDDi-Net are comparing it to the libraries listed below
Sorting:
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- Event Sequence Generation Network☆14Jun 22, 2021Updated 4 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)☆75Aug 25, 2021Updated 4 years ago
- Egocentric Video Description based on Temporally-Linked Sequences☆11Jul 17, 2017Updated 8 years ago
- Implementation for "Multilevel Language and Vision Integration for Text-to-Clip Retrieval"☆49Jan 21, 2019Updated 7 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…☆104Mar 21, 2020Updated 5 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Jan 6, 2019Updated 7 years ago
- Implementation of "Encoraging LSTMs to Anticipate Actions Very Early", ICCV 2017☆19Mar 25, 2018Updated 7 years ago
- Undergraduate Dissertation: Content-based video retrieval prototype for movies written in Python using OpenCV.☆16Jul 28, 2023Updated 2 years ago
- Phrase Localization Evaluation Toolkit☆20Aug 16, 2019Updated 6 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆152Jul 8, 2019Updated 6 years ago
- [ICIP 2019] Implementation of Saliency Tubes for 3D Convolutions in Pytoch and Keras to localise the focus spatio-temporal regions of 3D …☆54Apr 2, 2020Updated 5 years ago
- MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.☆24Jul 12, 2019Updated 6 years ago
- Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy☆55Jul 31, 2021Updated 4 years ago
- The official repository for DreamSampler (ECCV24)☆37Oct 11, 2024Updated last year
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 5 years ago
- with reinforcement learning☆31May 19, 2020Updated 5 years ago
- ☆27Nov 26, 2018Updated 7 years ago
- This simple script is for downloading videos of ActivityNet dataset by parsing URLs from given .json file.☆21Nov 30, 2017Updated 8 years ago
- A list of papers of temporal action detectino using deep learning.☆56Sep 25, 2019Updated 6 years ago
- Generating Video Description using Sequence-to-sequence Model with Temporal Attention☆33Mar 19, 2019Updated 6 years ago
- Video Summarization (Attention Mechanism and Hierarchical LSTM)☆31Feb 14, 2018Updated 8 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- Jcseg是基于mmseg算法的一个轻量级中文分词器,同时集成了关键字提取,关键短语提取,关键句子提取和文章自动摘要等功能,并且提供了一个基于Jetty的web服务器,方便各大语言直接http调用,同时提供了最新版本的lucene,solr和elasticsearch的分词…☆11Jan 22, 2017Updated 9 years ago
- ☆30Dec 16, 2018Updated 7 years ago
- ☆191Jun 16, 2025Updated 8 months ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Apr 10, 2020Updated 5 years ago
- Weakly Supervised Temporal Action Localization Using Deep Metric Learning☆27Mar 19, 2020Updated 5 years ago
- Code for the paper☆29May 30, 2019Updated 6 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 7 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Jun 29, 2020Updated 5 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Jul 7, 2021Updated 4 years ago
- temporal action detection: benchmark results, features download etc.☆203Jan 10, 2021Updated 5 years ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆144Apr 8, 2023Updated 2 years ago
- AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos. ECCV'18.☆78Jun 21, 2022Updated 3 years ago