theopsall / Video-SummarizationLinks
Multimodal summarization of user-generated videos from wearable cameras
☆22Updated 2 months ago
Alternatives and similar repositories for Video-Summarization
Users that are interested in Video-Summarization are comparing it to the libraries listed below
Sorting:
- Video Summarization With Spatiotemporal Vision Transformer☆21Updated 2 years ago
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆47Updated last year
- A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…☆90Updated 2 years ago
- It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.☆184Updated 2 months ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆228Updated 2 years ago
- IMPLEMENT AAAI 2018 - Unsupervised video summarization with deep reinforcement learning (PyTorch)☆42Updated 3 years ago
- Video Captioning is an encoder decoder mode based on sequence to sequence learning☆138Updated last year
- Source code for the AAAI 2021 paper "Movie Summarization via Sparse Graph Construction"☆32Updated 4 years ago
- Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks☆368Updated last year
- DSNet: A Flexible Detect-to-Summarize Network for Video Summarization☆219Updated 3 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆290Updated 2 years ago
- Key-frame based summarization of videos☆29Updated 2 years ago
- A Keras Implementation of Supervised Video Summarization using Attention Based Encoder-Decoder Networks☆29Updated 3 years ago
- Unimodal/Multimodal Sentiment Analysis、Emotion Recognition☆10Updated 3 years ago
- Experiments with multimodal deep learning models based on transformers☆12Updated 2 years ago
- Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and i…☆41Updated 3 years ago
- Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021☆21Updated 3 years ago
- Using VideoBERT to tackle video prediction☆130Updated 4 years ago
- Easiest way of fine-tuning HuggingFace video classification models☆142Updated 2 years ago
- Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"☆35Updated last week
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆56Updated 10 months ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆12Updated last year
- Undergraduate Dissertation: Content-based video retrieval prototype for movies written in Python using OpenCV.☆16Updated 2 years ago
- Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)☆45Updated last year
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 4 years ago
- Experimenting with different Summarizing techniques on SumMe Dataset☆141Updated 5 years ago
- Video to Text: Natural language description generator for some given video. [Video Captioning]☆355Updated 3 years ago
- Keywords to Sentences☆452Updated 2 years ago
- ☆13Updated 4 years ago