HERIUN / vsumm-reinforce_re
This repo contains the Pytorch implementation of the AAAI'18 paper - Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward.
☆11Updated last year
Alternatives and similar repositories for vsumm-reinforce_re:
Users that are interested in vsumm-reinforce_re are comparing it to the libraries listed below
- A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …☆29Updated 2 years ago
- A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…☆87Updated 2 years ago
- Video Summarization With Spatiotemporal Vision Transformer☆18Updated last year
- IMPLEMENT AAAI 2018 - Unsupervised video summarization with deep reinforcement learning (PyTorch)☆42Updated 3 years ago
- Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"☆33Updated last year
- Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)☆15Updated 10 months ago
- A PyTorch implementation of the software used in: "A study on the use of attention for explaining video summarization" (NarSUM Workshop a…☆11Updated last year
- Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)☆42Updated 10 months ago
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆31Updated 9 months ago
- [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning☆66Updated 11 months ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆75Updated last year
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆62Updated 7 months ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆37Updated last year
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆78Updated last year
- Pytorch code for paper Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization☆21Updated 2 years ago
- A computing solution based on deep learning that allows the efficient generation of keyshot type spotlights from videos.☆20Updated 3 years ago
- The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"☆47Updated last month
- Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …☆220Updated last year
- Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)☆96Updated last week
- [ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement☆30Updated last year
- [ICCV 2023] Accurate and Fast Compressed Video Captioning☆36Updated 11 months ago
- [ICIP 2022 oral] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning☆28Updated last year
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆22Updated last year
- ☆69Updated last year
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆99Updated last year
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆121Updated 5 months ago
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆38Updated 9 months ago
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆56Updated last year
- Papers, codes collection of video summarization / video highlight detection / video key frame selection☆35Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆89Updated 4 months ago