boheumd/A2Summ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/boheumd/A2Summ)

boheumd / A2Summ

The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)

☆86

Alternatives and similar repositories for A2Summ

Users that are interested in A2Summ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HopLee6 / SSPVS-PyTorch
View on GitHub
Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"
☆36Aug 26, 2025Updated 11 months ago
thswodnjs3 / CSTA
View on GitHub
The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"
☆70Jul 27, 2025Updated last year
xiyan-fu / MM-AVS
View on GitHub
A Full-Scale Dataset for Multi-modal Summarization
☆16Dec 8, 2021Updated 4 years ago
Skyline-9 / Visionary-Vids
View on GitHub
Multi-modal transformer approach for natural language query based joint video summarization and highlight detection
☆17May 23, 2024Updated 2 years ago
e-apostolidis / CA-SUM
View on GitHub
A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …
☆31Jun 29, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
iriscxy / VMSMO
View on GitHub
Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
☆36Jul 30, 2021Updated 4 years ago
e-apostolidis / PGL-SUM
View on GitHub
A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…
☆92Jan 30, 2023Updated 3 years ago
TIBHannover / MSVA
View on GitHub
Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)
☆47Mar 21, 2024Updated 2 years ago
jnzs1836 / intent-vizor
View on GitHub
☆16Jul 10, 2024Updated 2 years ago
StevRamos / video_summarization
View on GitHub
A computing solution based on deep learning that allows the efficient generation of keyshot type spotlights from videos.
☆19Jan 13, 2022Updated 4 years ago
TIBHannover / UnsupervisedVideoSummarization
View on GitHub
Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021
☆21Apr 5, 2022Updated 4 years ago
fjchange / Awesome_Video_Summarization
View on GitHub
Papers, codes collection of video summarization / video highlight detection / video key frame selection
☆37Jul 16, 2021Updated 5 years ago
HopLee6 / VJMHT-PyTorch
View on GitHub
Pytorch implementation for "Video Joint Modelling Based on Hierarchical Transformer for Co-summarization"
☆15Aug 24, 2025Updated 11 months ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
medhini / Instructional-Video-Summarization
View on GitHub
Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022
☆39Feb 17, 2023Updated 3 years ago
li-plus / DSNet
View on GitHub
DSNet: A Flexible Detect-to-Summarize Network for Video Summarization
☆223Sep 16, 2021Updated 4 years ago
weirme / FCSN
View on GitHub
A PyTorch reimplementation of FCSN in paper "Video Summarization Using Fully Convolutional Sequence Networks"
☆117Jun 20, 2023Updated 3 years ago
e-apostolidis / XAI-SUM
View on GitHub
A PyTorch implementation of the software used in: "A study on the use of attention for explaining video summarization" (NarSUM Workshop a…
☆11Oct 20, 2023Updated 2 years ago
MRHiSum / MR.HiSum
View on GitHub
☆56Nov 1, 2024Updated last year
e-apostolidis / PoR-Summarization-Measure
View on GitHub
A python implementation for computing the PoR metric for video summarization from "Performance over Random: A Robust Evaluation Protocol …
☆10May 4, 2022Updated 4 years ago
TencentARC / UMT
View on GitHub
UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …
☆238Apr 15, 2024Updated 2 years ago
Jielin-Qiu / MMWatermark-Robustness
View on GitHub
Evaluating Durability: Benchmark Insights into Multimodal Watermarking
☆12Jun 7, 2024Updated 2 years ago
Jielin-Qiu / Transfer_Knowledge_from_Language_to_ECG
View on GitHub
[EACL 2023] Transfer Knowledge from Natural Language to Electrocardiography: Can We Detect Cardiovascular Disease Through Language Models…
☆18May 7, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HLTCHKUST / VG-GPLMs
View on GitHub
The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".
☆57Jan 14, 2022Updated 4 years ago
theopsall / Video-Summarization
View on GitHub
Multimodal summarization of user-generated videos from wearable cameras
☆23Jun 22, 2025Updated last year
ZNLP / ZNLP-Dataset
View on GitHub
☆31Jul 23, 2025Updated last year
j-min / HiREST
View on GitHub
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
☆110Jan 23, 2025Updated last year
wjun0830 / CGDETR
View on GitHub
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…
☆154Aug 21, 2024Updated last year
luiscarlosgph / videosum
View on GitHub
Simple video summarisation Python package.
☆25Jan 29, 2024Updated 2 years ago
4paradigm-CV / SE-STAD
View on GitHub
☆10Jan 3, 2023Updated 3 years ago
Janie1996 / MSRFG
View on GitHub
The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations
☆11Jan 17, 2023Updated 3 years ago
showlab / UniVTG
View on GitHub
[ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding
☆380May 8, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HERIUN / vsumm-reinforce_re
View on GitHub
This repo contains the Pytorch implementation of the AAAI'18 paper - Deep Reinforcement Learning for Unsupervised Video Summarization wit…
☆11Jun 5, 2023Updated 3 years ago
CrossmodalGroup / ESL
View on GitHub
☆12May 3, 2024Updated 2 years ago
xiaomi1024 / code_SAMS
View on GitHub
☆13Jan 11, 2024Updated 2 years ago
RiTUAL-MBZUAI / Font-prediction-dataset
View on GitHub
This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"
☆11May 5, 2020Updated 6 years ago
nc-ai / MultimodalSum
View on GitHub
[ACL-IJCNLP 2021] Self-Supervised Multimodal Opinion Summarization
☆25Apr 6, 2024Updated 2 years ago
papermsucode / mdmmt
View on GitHub
MDMMT: Multidomain Multimodal Transformer for Video Retrieval
☆26Jun 28, 2021Updated 5 years ago
aspirinone / CATR.github.io
View on GitHub
☆31Mar 1, 2024Updated 2 years ago