Jielin-Qiu/MMSum_model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jielin-Qiu/MMSum_model)

Jielin-Qiu / MMSum_model

[CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos

☆38

Alternatives and similar repositories for MMSum_model

Users that are interested in MMSum_model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

medhini / Instructional-Video-Summarization
View on GitHub
Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022
☆39Feb 17, 2023Updated 3 years ago
ufal / MLASK
View on GitHub
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆11Nov 7, 2023Updated 2 years ago
Jielin-Qiu / MMWatermark-Robustness
View on GitHub
Evaluating Durability: Benchmark Insights into Multimodal Watermarking
☆12Jun 7, 2024Updated 2 years ago
HopLee6 / SSPVS-PyTorch
View on GitHub
Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"
☆36Aug 26, 2025Updated 11 months ago
Jielin-Qiu / MM_Robustness
View on GitHub
[DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift
☆39Jan 25, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Jiacheng-Zhu-AIML / AsymmetryLoRA
View on GitHub
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models
☆40Feb 27, 2024Updated 2 years ago
iriscxy / VMSMO
View on GitHub
Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
☆36Jul 30, 2021Updated 4 years ago
ZNLP / ZNLP-Dataset
View on GitHub
☆31Jul 23, 2025Updated last year
e-apostolidis / PGL-SUM
View on GitHub
A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…
☆92Jan 30, 2023Updated 3 years ago
Jielin-Qiu / Transfer_Knowledge_from_Language_to_ECG
View on GitHub
[EACL 2023] Transfer Knowledge from Natural Language to Electrocardiography: Can We Detect Cardiovascular Disease Through Language Models…
☆18May 7, 2024Updated 2 years ago
e-apostolidis / CA-SUM
View on GitHub
A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …
☆31Jun 29, 2022Updated 4 years ago
e-apostolidis / XAI-SUM
View on GitHub
A PyTorch implementation of the software used in: "A study on the use of attention for explaining video summarization" (NarSUM Workshop a…
☆11Oct 20, 2023Updated 2 years ago
InterDigitalInc / DialogSummary-VideoQA
View on GitHub
☆10Mar 30, 2022Updated 4 years ago
pangzss / pytorch-CTVSUM
View on GitHub
Pytorch code for paper Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization
☆22Jan 7, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jylins / videoxum
View on GitHub
[TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos
☆53Apr 9, 2024Updated 2 years ago
xiaomin418 / CFSum
View on GitHub
☆13Jan 9, 2024Updated 2 years ago
Skyline-9 / Visionary-Vids
View on GitHub
Multi-modal transformer approach for natural language query based joint video summarization and highlight detection
☆17May 23, 2024Updated 2 years ago
xiaomi1024 / code_SAMS
View on GitHub
☆13Jan 11, 2024Updated 2 years ago
SeasonDepth / SeasonDepth
View on GitHub
This package provides a python toolkit for the evaluation on the "SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchm…
☆48Sep 5, 2023Updated 2 years ago
201983290498 / lddu_mmer
View on GitHub
☆13Apr 2, 2025Updated last year
AlfredQin / STNet
View on GitHub
☆17Jul 18, 2023Updated 3 years ago
e-apostolidis / Video-Thumbnail-Selector
View on GitHub
A PyTorch Implementation of the Video Thumbnail Selector from "Combining Adversarial and Reinforcement Learning for Video Thumbnail Selec…
☆18May 30, 2022Updated 4 years ago
amankhullar / mast
View on GitHub
Code for the paper Multimodal Abstractive Summarization with Trimodal Hierarchical Attention
☆20Jan 25, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
qiaozhijian / vLPD-Net
View on GitHub
Registration-aided 3D Point Cloud Learning for Large-Scale Place Recognition (IROS 2021)
☆11May 28, 2022Updated 4 years ago
ok1zjf / VASNet
View on GitHub
PyTorch implementation of the ACCV 2018-AIU2018 paper Video Summarization with Attention
☆186Jul 16, 2022Updated 4 years ago
laurimi / multiagent-prediction-reward
View on GitHub
Multi-agent active perception with prediction rewards
☆12Nov 13, 2020Updated 5 years ago
phaphuang / DSR-RL
View on GitHub
Pytorch implementation of DSR-RL for Video Summarization Task
☆12Aug 30, 2021Updated 4 years ago
Mingqj / OcRFDet
View on GitHub
[ICCV 2025] OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving
☆15Jun 17, 2026Updated last month
NeuronXJTU / KFGNet
View on GitHub
☆23Sep 19, 2023Updated 2 years ago
chengzju / CARAT
View on GitHub
☆25Apr 16, 2025Updated last year
mangoggul / YOLO-MultiModal
View on GitHub
☆13Oct 8, 2024Updated last year
littlehacker26 / Discriminator-Cooperative-Unlikelihood-Prompt-Tuning
View on GitHub
The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…
☆27Nov 13, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
VincentYuuuuuu / LSM-YOLO
View on GitHub
Implementation of paper - LSM-YOLO: A Compact and Effective ROI Detector for Medical Detection
☆26Apr 24, 2025Updated last year
baoqianyue / DFC2021-Track-MSD
View on GitHub
Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD
☆10Mar 31, 2021Updated 5 years ago
Jiacheng-Zhu-AIML / WGPOT
View on GitHub
The Wasserstein Distance and Optimal Transport Map of Gaussian Processes
☆51Aug 3, 2020Updated 5 years ago
HenryLHH / fusion
View on GitHub
This is the source code of FUSION, a safety-aware causal representation for generalizable driving agents.
☆28Oct 23, 2024Updated last year
sokcertifiedrobustness / VeriGauge-deprecated
View on GitHub
☆11Oct 18, 2022Updated 3 years ago
HERIUN / vsumm-reinforce_re
View on GitHub
This repo contains the Pytorch implementation of the AAAI'18 paper - Deep Reinforcement Learning for Unsupervised Video Summarization wit…
☆11Jun 5, 2023Updated 3 years ago
SijieSong / CVPR21-Cogrounding_semantic_attention
View on GitHub
☆14Jul 13, 2021Updated 5 years ago