theopsall / Video-SummarizationLinks

Multimodal summarization of user-generated videos from wearable cameras

☆23

Alternatives and similar repositories for Video-Summarization

Users that are interested in Video-Summarization are comparing it to the libraries listed below

Sorting:

e-apostolidis / PGL-SUM
A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…
☆91Updated 2 years ago
jylins / videoxum
[TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos
☆53Updated last year
v-iashin / BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
☆230Updated 2 years ago
ihababdelkareem / video-summarization
Key-frame based summarization of videos
☆29Updated 3 years ago
nchucvml / STVT
Video Summarization With Spatiotemporal Vision Transformer
☆22Updated 2 years ago
yashkolli / Video-Summarization-Using-Attention
A Keras Implementation of Supervised Video Summarization using Attention Based Encoder-Decoder Networks
☆30Updated 3 years ago
v-iashin / MDVC
PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
☆144Updated 2 years ago
Shreyz-max / Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
☆138Updated last year
joelibaceta / video-keyframe-detector
It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.
☆197Updated 6 months ago
zhaoyang9425 / SER
Unimodal/Multimodal Sentiment Analysis、Emotion Recognition
☆10Updated 3 years ago
keplerlab / katna
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
☆385Updated last year
ammesatyajit / VideoBERT
Using VideoBERT to tackle video prediction
☆133Updated 4 years ago
HopLee6 / SSPVS-PyTorch
Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"
☆35Updated 3 months ago
jnzs1836 / intent-vizor
☆16Updated last year
simon-ging / coot-videotext
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆291Updated 3 years ago
affect2mm / emotion-timeseries
☆16Updated 5 years ago
TIBHannover / UnsupervisedVideoSummarization
Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021
☆21Updated 3 years ago
WasifurRahman / BERT_multimodal_transformer
☆212Updated 4 years ago
katha-ai / EmoTx-CVPR2023
[CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…
☆58Updated last year
TIBHannover / MSVA
Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)
☆47Updated last year
ufal / MLASK
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆12Updated 2 years ago
ppapalampidi / GraphTP
Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"
☆30Updated 4 years ago
akashe / Multimodal-action-recognition
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
☆73Updated 4 years ago
skeletonNN / CFN-SR
☆27Updated 4 years ago
tanishqgautam / Image-Captioning
Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformer…
☆40Updated 4 years ago
e-apostolidis / CA-SUM
A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …
☆31Updated 3 years ago
Tandon-A / emotic
PyTorch implementation of Emotic CNN methodology to recognize emotions in images using context information.
☆146Updated last year
Sejong-VLI / V2T-Action-Graph-JKSUCIS-2023
The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).
☆14Updated 2 years ago
phaphuang / DSR-RL
Pytorch implementation of DSR-RL for Video Summarization Task
☆12Updated 4 years ago
v-iashin / video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…
☆638Updated 10 months ago