HopLee6/VJMHT-PyTorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HopLee6/VJMHT-PyTorch)

HopLee6 / VJMHT-PyTorch

Pytorch implementation for "Video Joint Modelling Based on Hierarchical Transformer for Co-summarization"

☆15

Alternatives and similar repositories for VJMHT-PyTorch

Users that are interested in VJMHT-PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HopLee6 / SSPVS-PyTorch
View on GitHub
Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"
☆36Aug 26, 2025Updated 10 months ago
HopLee6 / VSCrowd-Dataset
View on GitHub
Dataset for "Video Crowd Localization with Multi-focus Gaussian Neighborhood Attention and a Large-Scale Benchmark"
☆39Dec 9, 2025Updated 7 months ago
xiong-zhitong / DACM-Few-shot.pytorch
View on GitHub
Code for Doubly deformable aggregation of covariance matrices for few-shot segmentation
☆16Oct 25, 2022Updated 3 years ago
nchucvml / STVT
View on GitHub
Video Summarization With Spatiotemporal Vision Transformer
☆23Jul 5, 2023Updated 3 years ago
thswodnjs3 / CSTA
View on GitHub
The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"
☆70Jul 27, 2025Updated 11 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
e-apostolidis / CA-SUM
View on GitHub
A PyTorch Implementation of CA-SUM from "Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of …
☆31Jun 29, 2022Updated 4 years ago
JinhaoLee / WCA
View on GitHub
[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
☆19Mar 23, 2026Updated 3 months ago
boheumd / A2Summ
View on GitHub
The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)
☆86Apr 24, 2023Updated 3 years ago
jylins / videoxum
View on GitHub
[TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos
☆53Apr 9, 2024Updated 2 years ago
iriscxy / VMSMO
View on GitHub
Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
☆36Jul 30, 2021Updated 4 years ago
pcshih / pytorch-VSLUD
View on GitHub
This is the implementation of the paper Video Summarization by Learning from Unpaired Data(CVPR2019)
☆37Sep 5, 2019Updated 6 years ago
li-plus / DSNet
View on GitHub
DSNet: A Flexible Detect-to-Summarize Network for Video Summarization
☆223Sep 16, 2021Updated 4 years ago
HERIUN / vsumm-reinforce_re
View on GitHub
This repo contains the Pytorch implementation of the AAAI'18 paper - Deep Reinforcement Learning for Unsupervised Video Summarization wit…
☆11Jun 5, 2023Updated 3 years ago
ufal / MLASK
View on GitHub
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆11Nov 7, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ezeli / InSentiCap_model
View on GitHub
A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).
☆11Jul 18, 2022Updated 4 years ago
weirme / FCSN
View on GitHub
A PyTorch reimplementation of FCSN in paper "Video Summarization Using Fully Convolutional Sequence Networks"
☆117Jun 20, 2023Updated 3 years ago
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
gbc-iitd / US_UCL
View on GitHub
[MICCAI'22] Unsupervised Contrastive Learning on Gall Bladder Ultrasound Videos
☆11May 28, 2023Updated 3 years ago
viswesh / Tweeties
View on GitHub
Stream tweets with React, Express, Socket.io and Twitter
☆11Apr 6, 2018Updated 8 years ago
leeh43 / Singularity_Deeplesion
View on GitHub
☆11Jun 5, 2021Updated 5 years ago
zhyu-lab / bmvae
View on GitHub
a variational autoencoder method for clustering single-cell mutation data
☆11Apr 17, 2024Updated 2 years ago
oldprogram / OpenCV-Clock-Identification
View on GitHub
用openCV做的时钟识别，主要用了霍夫变换。
☆15Dec 13, 2014Updated 11 years ago
tmlr-group / PART
View on GitHub
[ICML 2024] "Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training"
☆17Jun 4, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
longbai1006 / CAT-ViL
View on GitHub
Official implementation of “CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surg…
☆18Jul 7, 2024Updated 2 years ago
JeunyuLi / MUAF
View on GitHub
☆15Jun 27, 2023Updated 3 years ago
caravagnalab / rcongas
View on GitHub
Total copy number inference from single-cell RNA and ATAC sequing with cell clustering
☆12Oct 31, 2024Updated last year
ok1zjf / VASNet
View on GitHub
PyTorch implementation of the ACCV 2018-AIU2018 paper Video Summarization with Attention
☆186Jul 16, 2022Updated 4 years ago
e-apostolidis / XAI-SUM
View on GitHub
A PyTorch implementation of the software used in: "A study on the use of attention for explaining video summarization" (NarSUM Workshop a…
☆11Oct 20, 2023Updated 2 years ago
quangle2110 / GAN_Mask-RCNN
View on GitHub
☆22Oct 17, 2020Updated 5 years ago
AlfredQin / STNet
View on GitHub
☆17Jul 18, 2023Updated 3 years ago
zhangrh93 / InvertibleCE
View on GitHub
Invertible Concept-based Explanation (ICE)
☆19Oct 29, 2025Updated 8 months ago
luiscarlosgph / videosum
View on GitHub
Simple video summarisation Python package.
☆25Jan 29, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Kava-Labs / kvtool
View on GitHub
☆12Feb 13, 2025Updated last year
Jie-su / WSCNet
View on GitHub
Reproduce of 'Weakly Supervised Coupled Networks for Visual Sentiment Analysis'
☆13Nov 7, 2019Updated 6 years ago
wuzhe71 / STGN
View on GitHub
☆12Nov 4, 2022Updated 3 years ago
Sejong-VLI / V2T-Action-Graph-JKSUCIS-2023
View on GitHub
The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).
☆14Mar 29, 2023Updated 3 years ago
Skyline-9 / Visionary-Vids
View on GitHub
Multi-modal transformer approach for natural language query based joint video summarization and highlight detection
☆17May 23, 2024Updated 2 years ago
vignes-12 / senior-design-project-28-wafer-defect-detection
View on GitHub
This project aims to process 2D images of semiconductor silicon wafers to identify any defects on the wafers as well as their correspondi…
☆14May 9, 2023Updated 3 years ago
phaphuang / DSR-RL
View on GitHub
Pytorch implementation of DSR-RL for Video Summarization Task
☆12Aug 30, 2021Updated 4 years ago