theopsall / Video-SummarizationLinks
Multimodal summarization of user-generated videos from wearable cameras
☆23Updated 5 months ago
Alternatives and similar repositories for Video-Summarization
Users that are interested in Video-Summarization are comparing it to the libraries listed below
Sorting:
- A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…☆91Updated 2 years ago
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆53Updated last year
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆230Updated 2 years ago
- Key-frame based summarization of videos☆29Updated 3 years ago
- Video Summarization With Spatiotemporal Vision Transformer☆22Updated 2 years ago
- A Keras Implementation of Supervised Video Summarization using Attention Based Encoder-Decoder Networks☆30Updated 3 years ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆144Updated 2 years ago
- Video Captioning is an encoder decoder mode based on sequence to sequence learning☆138Updated last year
- It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.☆197Updated 6 months ago
- Unimodal/Multimodal Sentiment Analysis、Emotion Recognition☆10Updated 3 years ago
- Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks☆385Updated last year
- Using VideoBERT to tackle video prediction☆133Updated 4 years ago
- Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"☆35Updated 3 months ago
- ☆16Updated last year
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Updated 3 years ago
- ☆16Updated 5 years ago
- Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021☆21Updated 3 years ago
- ☆212Updated 4 years ago
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆58Updated last year
- Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)☆47Updated last year
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆12Updated 2 years ago
- Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"☆30Updated 4 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- ☆27Updated 4 years ago
- Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformer…☆40Updated 4 years ago
- A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …☆31Updated 3 years ago
- PyTorch implementation of Emotic CNN methodology to recognize emotions in images using context information.☆146Updated last year
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆14Updated 2 years ago
- Pytorch implementation of DSR-RL for Video Summarization Task☆12Updated 4 years ago
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆638Updated 10 months ago