MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.
☆24Jul 12, 2019Updated 6 years ago
Alternatives and similar repositories for VideoToTextDNN
Users that are interested in VideoToTextDNN are comparing it to the libraries listed below
Sorting:
- ☆15Aug 20, 2024Updated last year
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 4 years ago
- Video Captioning on MSR-VTT and MSVD dataset using Deep Learning☆21Aug 14, 2020Updated 5 years ago
- PyTorch implementation of video captioning☆13Sep 24, 2017Updated 8 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- ☆22Feb 25, 2021Updated 5 years ago
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 4 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"☆16Dec 31, 2017Updated 8 years ago
- implementation of TDConvED for video captioning☆13Mar 18, 2020Updated 6 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 5 years ago
- Study of frame rate effects on MSR-VTT dataset☆14Feb 10, 2018Updated 8 years ago
- ☆11Sep 15, 2017Updated 8 years ago
- ☆62May 11, 2021Updated 4 years ago
- Extracts the shot classes and generic visual features for a broadcast news video.☆13Jul 23, 2017Updated 8 years ago
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆27May 26, 2020Updated 5 years ago
- PyTorch code for Learning to Caption Images through a Lifetime by Asking Questions (ICCV 2019)☆16Sep 17, 2019Updated 6 years ago
- A presentation on Augmented CycleGAN and the papers that lead up to it☆11Dec 3, 2018Updated 7 years ago
- Dense video captioning in PyTorch☆41Aug 30, 2019Updated 6 years ago
- automatic video description generation with GPU training☆256Jan 12, 2020Updated 6 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- Aims at attributing the big-five personality traits to authors of essays by analyzing their works.☆17Dec 19, 2018Updated 7 years ago
- ☆23Jul 20, 2017Updated 8 years ago
- Shot threading and scene detection in TV series☆23Dec 8, 2016Updated 9 years ago
- Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy☆55Jul 31, 2021Updated 4 years ago
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- Video captioning baseline models on Video2Commonsense Dataset.☆56Apr 15, 2021Updated 4 years ago
- Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020☆12Aug 28, 2020Updated 5 years ago
- Various implementations and experimentation for deep neural network model compression☆24Sep 6, 2018Updated 7 years ago
- Video Summarization (Attention Mechanism and Hierarchical LSTM)☆31Feb 14, 2018Updated 8 years ago
- Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds☆20Dec 18, 2021Updated 4 years ago
- Implementation and improvement of paper 'Learning Multiple Views with Orthogonal Denoising Autoencoders'☆16Jul 18, 2024Updated last year
- Jcseg是基于mmseg算法的一个轻量级中文分词器,同时集成了关键字提取,关键短语提取,关键句子提取和文章自动摘要等功能,并且提供了一个基于Jetty的web服务器,方便各大语言直接http调用,同时提供了最新版本的lucene,solr和elasticsearch的分词…☆11Jan 22, 2017Updated 9 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)☆34Jul 17, 2019Updated 6 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- Code and pre-trained models for my submission to the ChaLearn 2021 LAP challenge.☆18Sep 5, 2022Updated 3 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15May 6, 2021Updated 4 years ago
- multi-scale multi-object star model Matlab code for single shot object recognition☆27Mar 4, 2018Updated 8 years ago