MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.
☆24Jul 12, 2019Updated 6 years ago
Alternatives and similar repositories for VideoToTextDNN
Users that are interested in VideoToTextDNN are comparing it to the libraries listed below
Sorting:
- ☆15Aug 20, 2024Updated last year
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 4 years ago
- CVPR2022 update everyday!☆11Apr 12, 2022Updated 3 years ago
- video captioning☆24Mar 14, 2019Updated 6 years ago
- A large scale dataset for Video Captioning in Italian☆13May 16, 2023Updated 2 years ago
- Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"☆16Dec 31, 2017Updated 8 years ago
- ☆11Sep 15, 2017Updated 8 years ago
- PyTorch implementation of video captioning☆13Sep 24, 2017Updated 8 years ago
- A presentation on Augmented CycleGAN and the papers that lead up to it☆11Dec 3, 2018Updated 7 years ago
- Extracts the shot classes and generic visual features for a broadcast news video.☆13Jul 23, 2017Updated 8 years ago
- implementation of TDConvED for video captioning☆13Mar 18, 2020Updated 5 years ago
- Cost-Effective Object Detection: Active Sample Mining with Switchable Selection Criteria☆12Dec 1, 2018Updated 7 years ago
- ☆16Jun 18, 2025Updated 8 months ago
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 4 years ago
- ☆15Aug 16, 2019Updated 6 years ago
- ☆22Feb 25, 2021Updated 5 years ago
- Implementation and improvement of paper 'Learning Multiple Views with Orthogonal Denoising Autoencoders'☆16Jul 18, 2024Updated last year
- ☆15Aug 22, 2019Updated 6 years ago
- Study of frame rate effects on MSR-VTT dataset☆14Feb 10, 2018Updated 8 years ago
- Python snippets☆21Mar 10, 2020Updated 5 years ago
- GCNet (GIF Caption Network) | Neural Network Generated GIF Captions☆15Nov 29, 2016Updated 9 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- PyTorch code for Learning to Caption Images through a Lifetime by Asking Questions (ICCV 2019)☆16Sep 17, 2019Updated 6 years ago
- Various implementations and experimentation for deep neural network model compression☆24Sep 6, 2018Updated 7 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆23Nov 4, 2020Updated 5 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- fashionAI clothes keypoint detection☆21Jun 5, 2018Updated 7 years ago
- Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy☆55Jul 31, 2021Updated 4 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 5 years ago
- ☆55May 14, 2020Updated 5 years ago
- multi-scale multi-object star model Matlab code for single shot object recognition☆27Mar 4, 2018Updated 7 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 5 years ago
- Video captioning baseline models on Video2Commonsense Dataset.☆57Apr 15, 2021Updated 4 years ago
- The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆26Nov 23, 2018Updated 7 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- Generating Video Description using Sequence-to-sequence Model with Temporal Attention☆33Mar 19, 2019Updated 6 years ago
- automatic video description generation with GPU training☆256Jan 12, 2020Updated 6 years ago
- An easy-to-use tool to extract frames from video and store into database.☆32Jan 4, 2019Updated 7 years ago
- Video Summarization (Attention Mechanism and Hierarchical LSTM)☆31Feb 14, 2018Updated 8 years ago