OSUPCVLab/VideoToTextDNN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OSUPCVLab/VideoToTextDNN)

OSUPCVLab / VideoToTextDNN

MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.

☆24

Alternatives and similar repositories for VideoToTextDNN

Users that are interested in VideoToTextDNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

maurya-rohit / Scene-Graph-For-Videos
View on GitHub
☆15Aug 20, 2024Updated last year
crux82 / msr-vtt-it
View on GitHub
A large scale dataset for Video Captioning in Italian
☆13May 16, 2023Updated 3 years ago
DWCTOD / arXiv-CVPR2022-daily
View on GitHub
CVPR2022 update everyday!
☆11Apr 12, 2022Updated 4 years ago
Peratham / video2text.pytorch
View on GitHub
PyTorch implementation of video captioning
☆13Sep 24, 2017Updated 8 years ago
daicoolb / Awesome-Video-Captioning
View on GitHub
video captioning
☆24Mar 14, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Cadene / torchnet-m2caiworkflow
View on GitHub
Finalist entry for the M2CAI Workflow Challenge 2016
☆10Nov 25, 2016Updated 9 years ago
jamespark3922 / lsmdc-baseline
View on GitHub
☆15Aug 16, 2019Updated 6 years ago
WingsBrokenAngel / delving-deeper-into-the-decoder-for-video-captioning
View on GitHub
Source code for Delving Deeper into the Decoder for Video Captioning
☆39Jun 1, 2021Updated 5 years ago
roeeaharoni / string-to-tree-nmt
View on GitHub
Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"
☆16Dec 31, 2017Updated 8 years ago
b05902062 / TDConvED
View on GitHub
implementation of TDConvED for video captioning
☆13Mar 18, 2020Updated 6 years ago
szq0214 / MSR-VTT-Challenge
View on GitHub
Video to Language Challenge (MSR-VTT Challenge 2016)
☆32Dec 28, 2017Updated 8 years ago
Curious-Geek / Video-Captioning
View on GitHub
Study of frame rate effects on MSR-VTT dataset
☆14Feb 10, 2018Updated 8 years ago
feichtenhofer / temporal-resnet
View on GitHub
☆11Sep 15, 2017Updated 8 years ago
gshruti95 / news-shot-classification
View on GitHub
Extracts the shot classes and generic visual features for a broadcast news video.
☆13Jul 23, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
fidler-lab / Caption-Lifetime-by-Asking-Questions
View on GitHub
PyTorch code for Learning to Caption Images through a Lifetime by Asking Questions (ICCV 2019)
☆16Sep 17, 2019Updated 6 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
NathanDeMaria / AugmentedCycleGAN
View on GitHub
A presentation on Augmented CycleGAN and the papers that lead up to it
☆11Dec 3, 2018Updated 7 years ago
juditacs / snippets
View on GitHub
Python snippets
☆21Mar 10, 2020Updated 6 years ago
LuoweiZhou / densecap
View on GitHub
Dense video captioning in PyTorch
☆41Aug 30, 2019Updated 6 years ago
VisionLearningGroup / JEDDi-Net
View on GitHub
Implementation for "Joint Event Detection and Description in Continuous Video Streams"
☆23Nov 4, 2020Updated 5 years ago
sammy-su / Pano2Vid
View on GitHub
☆23Jul 20, 2017Updated 9 years ago
li-xirong / video-retrieval
View on GitHub
Deep Learning for Video Retrieval by Natural Language
☆11Oct 20, 2019Updated 6 years ago
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
ZhecanJamesWang / GLAT_SGG
View on GitHub
Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"
☆11Dec 16, 2020Updated 5 years ago
xiadingZ / video-caption-openNMT.pytorch
View on GitHub
implement video caption based on openNMT
☆36Apr 19, 2018Updated 8 years ago
Ayonksh / Video-Object-Removal
View on GitHub
只要给物体画上一个方框，就可以在视频中去除这个物体并修复视频
☆11Apr 5, 2022Updated 4 years ago
yuriytkach / fundraiser-tracker
View on GitHub
Fundraiser Tracker implemented as AWS Lambda with ability to manage through Slack and autosync with Monobank and Privatbank
☆10Apr 24, 2025Updated last year
yrcong / NODIS
View on GitHub
Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020
☆12Aug 28, 2020Updated 5 years ago
ChristopherSweeney / SlimNets
View on GitHub
Various implementations and experimentation for deep neural network model compression
☆24Sep 6, 2018Updated 7 years ago
aistairc / seq2seq_temporal_attention
View on GitHub
Generating Video Description using Sequence-to-sequence Model with Temporal Attention
☆33Mar 19, 2019Updated 7 years ago
tengerye / orthogonal-denoising-autoencoder
View on GitHub
Implementation and improvement of paper 'Learning Multiple Views with Orthogonal Denoising Autoencoders'
☆16Jul 18, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
tuyunbin / Video-Description-with-Spatial-Temporal-Attention
View on GitHub
[ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"
☆61Oct 20, 2020Updated 5 years ago
cnsuhao / jcseg
View on GitHub
Jcseg是基于mmseg算法的一个轻量级中文分词器，同时集成了关键字提取，关键短语提取，关键句子提取和文章自动摘要等功能，并且提供了一个基于Jetty的web服务器，方便各大语言直接http调用，同时提供了最新版本的lucene，solr和elasticsearch的分词…
☆11Jan 22, 2017Updated 9 years ago
amazon-science / gluonmm
View on GitHub
A library of transformer models for computer vision and multi-modality research
☆49Sep 7, 2021Updated 4 years ago
lenML / gcp-claude-openai-api-server
View on GitHub
converts Vertex AI API to OpenAI API format.
☆12Oct 23, 2024Updated last year
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
v-iashin / MDVC
View on GitHub
PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
☆144Apr 8, 2023Updated 3 years ago
AmingWu / CCN
View on GitHub
Connective Cognition Network for Directional Visual Commonsense Reasoning
☆15May 6, 2021Updated 5 years ago