theopsall / Video-SummarizationLinks
Multimodal summarization of user-generated videos from wearable cameras
☆22Updated 4 months ago
Alternatives and similar repositories for Video-Summarization
Users that are interested in Video-Summarization are comparing it to the libraries listed below
Sorting:
- Key-frame based summarization of videos☆29Updated 2 years ago
- A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…☆91Updated 2 years ago
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆50Updated last year
- Video Summarization With Spatiotemporal Vision Transformer☆22Updated 2 years ago
- A Keras Implementation of Supervised Video Summarization using Attention Based Encoder-Decoder Networks☆29Updated 3 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆228Updated 2 years ago
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆12Updated last year
- Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"☆35Updated 2 months ago
- Experimenting with different Summarizing techniques on SumMe Dataset☆141Updated 5 years ago
- Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"☆30Updated 4 years ago
- EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important tempo…☆22Updated last year
- DSNet: A Flexible Detect-to-Summarize Network for Video Summarization☆219Updated 4 years ago
- Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021☆21Updated 3 years ago
- It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.☆189Updated 4 months ago
- IMPLEMENT AAAI 2018 - Unsupervised video summarization with deep reinforcement learning (PyTorch)☆42Updated 4 years ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆143Updated 2 years ago
- Using VideoBERT to tackle video prediction☆132Updated 4 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- repo for active speaker detection for media videos.☆29Updated last year
- Video Captioning is an encoder decoder mode based on sequence to sequence learning☆137Updated last year
- Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)☆46Updated last year
- A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …☆31Updated 3 years ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆13Updated 2 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆290Updated 3 years ago
- Undergraduate Dissertation: Content-based video retrieval prototype for movies written in Python using OpenCV.☆16Updated 2 years ago
- Pytorch implementation of DSR-RL for Video Summarization Task☆12Updated 4 years ago
- Video to Text: Natural language description generator for some given video. [Video Captioning]☆359Updated 3 years ago
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆58Updated last year
- Implement of Video Embedding based on Tensorflow, Inception-V3 & FCNN(Frames Supported Convolution Neural Network)☆77Updated 2 years ago
- Easiest way of fine-tuning HuggingFace video classification models☆145Updated 2 years ago