scopeInfinity / Video2DescriptionLinks

Video to Text: Natural language description generator for some given video. [Video Captioning]

☆350

Alternatives and similar repositories for Video2Description

Users that are interested in Video2Description are comparing it to the libraries listed below

Sorting:

v-iashin / BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
☆227Updated 2 years ago
vijayvee / video-captioning
This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…
☆166Updated 5 years ago
facebookresearch / grounded-video-description
Video Grounding and Captioning
☆326Updated 3 years ago
Shreyz-max / Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
☆137Updated last year
KaiyangZhou / pytorch-vsumm-reinforce
Unsupervised video summarization with deep reinforcement learning (AAAI'18)
☆496Updated last year
simon-ging / coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆289Updated 2 years ago
antoine77340 / video_feature_extractor
Easy to use video deep features extractor
☆319Updated 5 years ago
shruti-jadon / Video-Summarization-using-Keyframe-Extraction-and-Video-Skimming
Experimenting with different Summarizing techniques on SumMe Dataset
☆139Updated 5 years ago
Kamino666 / Video-Captioning-Transformer
这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。视频描述生成任务指的是：输入一个视频，输出一句描述整个视频内容的文字（前提是视频较短且可以用一句话来描述）。本repo主要目的是帮助视力障碍…
☆93Updated 3 years ago
xiadingZ / video-caption.pytorch
pytorch implementation of video captioning
☆398Updated 5 years ago
anyirao / SceneSeg
Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
☆229Updated last year
albanie / collaborative-experts
Video embeddings for retrieval with natural language queries
☆342Updated 2 years ago
amanwalia123 / KeyFramesExtraction
This repository contains script to divide a video into key frames.
☆174Updated 7 years ago
robi56 / video-summarization-resources
Video Summarization Dataset, Papers, Codes
☆169Updated 6 years ago
movienet / movienet-tools
Tools for movie and video research
☆290Updated 3 years ago
v-iashin / MDVC
PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
☆143Updated 2 years ago
JaywongWang / DenseVideoCaptioning
Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…
☆150Updated 6 years ago
li-plus / DSNet
DSNet: A Flexible Detect-to-Summarize Network for Video Summarization
☆218Updated 3 years ago
hobincar / pytorch-video-feature-extractor
A repository for extract CNN features from videos using pytorch
☆70Updated 2 years ago
v-iashin / video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…
☆605Updated 5 months ago
ok1zjf / VASNet
PyTorch implementation of the ACCV 2018-AIU2018 paper Video Summarization with Attention
☆183Updated 3 years ago
jnzs1836 / intent-vizor
☆16Updated last year
ammesatyajit / VideoBERT
Using VideoBERT to tackle video prediction
☆130Updated 4 years ago
yalesong / tvsum
TVSum: Title-based Video Summarization dataset (CVPR 2015)
☆129Updated 5 years ago
joelibaceta / video-keyframe-detector
It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.
☆174Updated last month
pochih / Video-Cap
🎬 Video Captioning: ICCV '15 paper implementation
☆47Updated 7 years ago
gabeur / mmt
Multi-Modal Transformer for Video Retrieval
☆260Updated 9 months ago
weirme / FCSN
A PyTorch reimplementation of FCSN in paper "Video Summarization Using Fully Convolutional Sequence Networks"
☆118Updated 2 years ago
salesforce / densecap
☆191Updated last month
oddguan / Audio-Visual-Video-Caption
Pytorch implementation of audio-visual fusion video captioning model
☆27Updated 6 years ago