jylins / videoxumView external linksLinks
[TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos
☆53Apr 9, 2024Updated last year
Alternatives and similar repositories for videoxum
Users that are interested in videoxum are comparing it to the libraries listed below
Sorting:
- Video Summarization With Spatiotemporal Vision Transformer☆23Jul 5, 2023Updated 2 years ago
- A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …☆31Jun 29, 2022Updated 3 years ago
- Pytorch code for paper Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization☆21Jan 7, 2023Updated 3 years ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆85Apr 24, 2023Updated 2 years ago
- Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021☆21Apr 5, 2022Updated 3 years ago
- Pytorch implementation for "Video Joint Modelling Based on Hierarchical Transformer for Co-summarization"☆15Aug 24, 2025Updated 5 months ago
- Simple video summarisation Python package.☆23Jan 29, 2024Updated 2 years ago
- Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"☆36Aug 26, 2025Updated 5 months ago
- Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)☆47Mar 21, 2024Updated last year
- A PyTorch implementation of the software used in: "A study on the use of attention for explaining video summarization" (NarSUM Workshop a…☆11Oct 20, 2023Updated 2 years ago
- Ultrasound Video Summarization using Deep Reinforcement Learning☆25Oct 6, 2020Updated 5 years ago
- ☆14Jul 21, 2023Updated 2 years ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆14Mar 29, 2023Updated 2 years ago
- [CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos☆37Jan 29, 2025Updated last year
- Multi-modal transformer approach for natural language query based joint video summarization and highlight detection☆17May 23, 2024Updated last year
- A High-level Library for Named Entity Recognition in Python.☆25Dec 7, 2023Updated 2 years ago
- A computing solution based on deep learning that allows the efficient generation of keyshot type spotlights from videos.☆19Jan 13, 2022Updated 4 years ago
- DSNet: A Flexible Detect-to-Summarize Network for Video Summarization☆219Sep 16, 2021Updated 4 years ago
- [CVPR 2024] Context-Guided Spatio-Temporal Video Grounding☆66Jun 28, 2024Updated last year
- 【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?☆253Nov 29, 2024Updated last year
- ☆203Jul 12, 2024Updated last year
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Nov 2, 2023Updated 2 years ago
- A PyTorch reimplementation of FCSN in paper "Video Summarization Using Fully Convolutional Sequence Networks"☆117Jun 20, 2023Updated 2 years ago
- Repository for the mijn.amsterdam.nl portal☆11Updated this week
- SPA: Efficient User-Preference Alignment against Uncertainty in Medical Image Segmentation (ICCV 2025)☆14Sep 26, 2025Updated 4 months ago
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- LD-Explorer is the missing tool for exploring, federating and querying linked data resources directly from the browser☆19Updated this week
- A modern audio editor with multitrack capabilities, enhanced waveform visualization, and an intuitive, sleek interface.☆17Aug 12, 2025Updated 6 months ago
- ☆14Nov 26, 2025Updated 2 months ago
- TVSum: Title-based Video Summarization dataset (CVPR 2015)☆134Nov 10, 2019Updated 6 years ago
- ☆10Jun 6, 2024Updated last year
- ☆10Jul 12, 2017Updated 8 years ago
- A Video Summarization framework for implementation and benchmark of Deep Learning models☆33Sep 9, 2024Updated last year
- PyTorch Implementation of Temporal Output Discrepancy for Active Learning, ICCV 2021☆40Sep 14, 2022Updated 3 years ago
- ☆48Nov 1, 2024Updated last year
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.☆168Jan 30, 2025Updated last year
- A Nigerian online store price comparison website☆12Dec 9, 2022Updated 3 years ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago