[CVPR 2024] MMSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
☆37Jan 29, 2025Updated last year
Alternatives and similar repositories for MMSum_model
Users that are interested in MMSum_model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆86Apr 24, 2023Updated 2 years ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆39Feb 17, 2023Updated 3 years ago
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- Deep learning model for supervised video summarization called Multi Source Visual Attention (MSVA)☆47Mar 21, 2024Updated 2 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆39Feb 27, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is the official code for CoRL 2022 "Robustness Certification of Visual Perception Models via Camera Motion Smoothing"☆11Apr 5, 2023Updated 3 years ago
- Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''☆36Jul 30, 2021Updated 4 years ago
- ☆29Jul 23, 2025Updated 8 months ago
- A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …☆30Jun 29, 2022Updated 3 years ago
- ☆17Feb 21, 2020Updated 6 years ago
- ☆23Jul 13, 2021Updated 4 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆20May 8, 2025Updated 11 months ago
- ☆10Mar 30, 2022Updated 4 years ago
- ☆13Jan 11, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Source code for the paper "Unsupervised Video Summarization via Multi-source Features" published at ICMR 2021☆21Apr 5, 2022Updated 4 years ago
- ☆13Jan 9, 2024Updated 2 years ago
- This package provides a python toolkit for the evaluation on the "SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchm…☆48Sep 5, 2023Updated 2 years ago
- ☆13Apr 2, 2025Updated last year
- ☆17Jul 18, 2023Updated 2 years ago
- Multi-modal transformer approach for natural language query based joint video summarization and highlight detection☆17May 23, 2024Updated last year
- Summarization of Multimodal articles☆10Oct 14, 2022Updated 3 years ago
- Registration-aided 3D Point Cloud Learning for Large-Scale Place Recognition (IROS 2021)☆11May 28, 2022Updated 3 years ago
- Code for the paper Multimodal Abstractive Summarization with Trimodal Hierarchical Attention☆20Jan 25, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the official released code for CVPR 2022 "Investigating the Impact of Multi-LiDAR Placement on Object Detection for Autonomous Dr…☆47Sep 6, 2022Updated 3 years ago
- PyTorch implementation of the ACCV 2018-AIU2018 paper Video Summarization with Attention☆186Jul 16, 2022Updated 3 years ago
- Multi-agent active perception with prediction rewards☆11Nov 13, 2020Updated 5 years ago
- ☆14Jun 17, 2024Updated last year
- Pytorch implementation of DSR-RL for Video Summarization Task☆12Aug 30, 2021Updated 4 years ago
- ☆10Jun 6, 2024Updated last year
- ☆25Apr 16, 2025Updated last year
- Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …☆248Aug 12, 2025Updated 8 months ago
- A Full-Scale Dataset for Multi-modal Summarization☆16Dec 8, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Fourth edition of VNN COMP (2023)☆16Apr 12, 2023Updated 3 years ago
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 5 years ago
- Implementation of paper - LSM-YOLO: A Compact and Effective ROI Detector for Medical Detection☆26Apr 24, 2025Updated 11 months ago
- This is the implementation of the paper Video Summarization by Learning from Unpaired Data(CVPR2019)☆37Sep 5, 2019Updated 6 years ago
- Recent Advances in Visual Dialog☆30Aug 19, 2022Updated 3 years ago
- ☆12Oct 8, 2024Updated last year
- A reading list of papers about Visual Grounding.☆32Aug 24, 2022Updated 3 years ago