iriscxy/VMSMO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iriscxy/VMSMO)

iriscxy / VMSMO

Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''

☆36

Alternatives and similar repositories for VMSMO

Users that are interested in VMSMO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xiyan-fu / MM-AVS
View on GitHub
A Full-Scale Dataset for Multi-modal Summarization
☆16Dec 8, 2021Updated 4 years ago
amankhullar / mast
View on GitHub
Code for the paper Multimodal Abstractive Summarization with Trimodal Hierarchical Attention
☆20Jan 25, 2022Updated 4 years ago
darthgera123 / Multimodal-Summarization
View on GitHub
Summarization of Multimodal articles
☆10Oct 14, 2022Updated 3 years ago
ufal / MLASK
View on GitHub
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆11Nov 7, 2023Updated 2 years ago
ZNLP / ZNLP-Dataset
View on GitHub
☆31Jul 23, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
forkarinda / MFN
View on GitHub
Multistage Fusion with Forget Gate for Multimodal Summarization in Open-Domain Videos
☆12Oct 8, 2020Updated 5 years ago
HLTCHKUST / VG-GPLMs
View on GitHub
The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".
☆57Jan 14, 2022Updated 4 years ago
xiaomin418 / CFSum
View on GitHub
☆13Jan 9, 2024Updated 2 years ago
boheumd / A2Summ
View on GitHub
The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)
☆86Apr 24, 2023Updated 3 years ago
hrlinlp / cepsum
View on GitHub
☆43Jun 8, 2022Updated 4 years ago
Jielin-Qiu / MMSum_model
View on GitHub
[CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
☆38Jan 29, 2025Updated last year
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
v-iashin / MDVC
View on GitHub
PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
☆144Apr 8, 2023Updated 3 years ago
shengyuzhang / Poet
View on GitHub
Poet: Product-oriented Video Captioner for E-commerce
☆12Sep 21, 2020Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Jhhuangkay / Query-controllable-Video-Summarization
View on GitHub
☆28Aug 3, 2020Updated 5 years ago
jayleicn / TVCaption
View on GitHub
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆91Sep 6, 2023Updated 2 years ago
HopLee6 / VJMHT-PyTorch
View on GitHub
Pytorch implementation for "Video Joint Modelling Based on Hierarchical Transformer for Co-summarization"
☆15Aug 24, 2025Updated 11 months ago
thaolmk54 / hcrn-videoqa
View on GitHub
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆135Jul 25, 2024Updated 2 years ago
TIBHannover / UnsupervisedVideoSummarization
View on GitHub
Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021
☆21Apr 5, 2022Updated 4 years ago
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
maurya-rohit / Scene-Graph-For-Videos
View on GitHub
☆15Aug 20, 2024Updated last year
jayleicn / VideoLanguageFuturePred
View on GitHub
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆52Aug 20, 2022Updated 3 years ago
fireflyHunter / OpenNMT-Livebot
View on GitHub
Re-implementation of the work Livebot
☆16Jun 21, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
gsh199449 / proto-summ
View on GitHub
Dataset proposed by ''How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing''
☆18May 4, 2021Updated 5 years ago
AnnikaLindh / Diverse_and_Specific_Image_Captioning
View on GitHub
Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…
☆13May 25, 2025Updated last year
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
HopLee6 / SSPVS-PyTorch
View on GitHub
Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"
☆36Aug 26, 2025Updated 11 months ago
jayleicn / TVRetrieval
View on GitHub
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
☆163May 28, 2024Updated 2 years ago
eric-xw / kinetics-i3d-pytorch
View on GitHub
☆35Mar 22, 2019Updated 7 years ago
yj-yu / lsmdc
View on GitHub
☆33Nov 12, 2018Updated 7 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
View on GitHub
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34May 14, 2020Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
srvk / how2-dataset
View on GitHub
This repository contains code and metadata of How2 dataset
☆192Dec 30, 2024Updated last year
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
frankxu2004 / cooking-procedural-extraction
View on GitHub
☆19May 2, 2020Updated 6 years ago
LuoweiZhou / detectron-vlp
View on GitHub
Detectron for image/video region feature extraction, inspired by Xinlei's repo
☆22Nov 21, 2020Updated 5 years ago
szq0214 / MSR-VTT-Challenge
View on GitHub
Video to Language Challenge (MSR-VTT Challenge 2016)
☆32Dec 28, 2017Updated 8 years ago
delchiaro / RATT
View on GitHub
☆18Oct 3, 2023Updated 2 years ago
Harryjun / pytorch-vsumm-reinforce
View on GitHub
AAAI 2018 - Unsupervised video summarization with deep reinforcement learning (PyTorch)
☆13Oct 31, 2019Updated 6 years ago