thswodnjs3/CSTA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thswodnjs3/CSTA)

thswodnjs3 / CSTA

The official code of "CSTA: CNN-based Spatiotemporal Attention for Video Summarization"

☆70

Alternatives and similar repositories for CSTA

Users that are interested in CSTA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nchucvml / STVT
View on GitHub
Video Summarization With Spatiotemporal Vision Transformer
☆23Jul 5, 2023Updated 3 years ago
HopLee6 / VJMHT-PyTorch
View on GitHub
Pytorch implementation for "Video Joint Modelling Based on Hierarchical Transformer for Co-summarization"
☆15Aug 24, 2025Updated 11 months ago
e-apostolidis / PGL-SUM
View on GitHub
A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…
☆92Jan 30, 2023Updated 3 years ago
li-plus / DSNet
View on GitHub
DSNet: A Flexible Detect-to-Summarize Network for Video Summarization
☆223Sep 16, 2021Updated 4 years ago
StevRamos / video_summarization
View on GitHub
A computing solution based on deep learning that allows the efficient generation of keyshot type spotlights from videos.
☆19Jan 13, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
weirme / FCSN
View on GitHub
A PyTorch reimplementation of FCSN in paper "Video Summarization Using Fully Convolutional Sequence Networks"
☆117Jun 20, 2023Updated 3 years ago
Lorna-Liu / ultrasound_vsumm_RL
View on GitHub
Ultrasound Video Summarization using Deep Reinforcement Learning
☆25Oct 6, 2020Updated 5 years ago
e-apostolidis / AC-SUM-GAN
View on GitHub
A PyTorch Implementation of AC-SUM-GAN from "AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Vid…
☆28May 4, 2022Updated 4 years ago
luiscarlosgph / videosum
View on GitHub
Simple video summarisation Python package.
☆25Jan 29, 2024Updated 2 years ago
HERIUN / vsumm-reinforce_re
View on GitHub
This repo contains the Pytorch implementation of the AAAI'18 paper - Deep Reinforcement Learning for Unsupervised Video Summarization wit…
☆11Jun 5, 2023Updated 3 years ago
kkyuhun94 / dalda
View on GitHub
[ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
☆33Feb 6, 2026Updated 5 months ago
justarter / E2URec
View on GitHub
Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…
☆38Jul 19, 2024Updated 2 years ago
iamgmujtaba / LTC-SUM
View on GitHub
Implementation of LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN
☆22Jul 11, 2023Updated 3 years ago
m1k2zoo / negbench
View on GitHub
Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"
☆48Feb 26, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
KaiyangZhou / pytorch-vsumm-reinforce
View on GitHub
Unsupervised video summarization with deep reinforcement learning (AAAI'18)
☆505Dec 11, 2023Updated 2 years ago
Jhhuangkay / Query-controllable-Video-Summarization
View on GitHub
☆28Aug 3, 2020Updated 5 years ago
ExMorgan-Alter / NeighborNet
View on GitHub
This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.
☆29Mar 19, 2025Updated last year
BerasiDavide / vlm_image_compositionality
View on GitHub
[CVPR'25] Official implementation of the paper "Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Mo…
☆18Nov 21, 2025Updated 8 months ago
wjun0830 / CGDETR
View on GitHub
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…
☆154Aug 21, 2024Updated last year
sylvainma / Summarizer
View on GitHub
A Video Summarization framework for implementation and benchmark of Deep Learning models
☆33Sep 9, 2024Updated last year
haozhiwen-fighting / Contrast-enhanced-Ultrasound-for-Thyroid-Nodules-Diagnosis
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
jbistanbul / hieramamba
View on GitHub
Official Code for the paper "HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling"
☆16Apr 30, 2026Updated 2 months ago
medhini / Instructional-Video-Summarization
View on GitHub
Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022
☆39Feb 17, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MRHiSum / MR.HiSum
View on GitHub
☆56Nov 1, 2024Updated last year
mangoggul / YOLO-MultiModal
View on GitHub
☆13Oct 8, 2024Updated last year
Huntersxsx / MGPN
View on GitHub
source code of our MGPN in SIGIR 2022
☆18Jun 8, 2022Updated 4 years ago
hanghuacs / V2Xum-LLM
View on GitHub
☆27Jan 4, 2025Updated last year
DavidQiuChao / NPLIE
View on GitHub
This code is a python implementation of the paper, "Illumination Estimation for Nature Preserving Low Light Image Enhancement",in 2020.
☆12Jan 12, 2021Updated 5 years ago
j-min / HiREST
View on GitHub
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
☆110Jan 23, 2025Updated last year
Rongjiehuang / awesome-speech-to-speech-translation
View on GitHub
List of direct speech-to-speech translation papers.
☆39Jan 31, 2023Updated 3 years ago
ufal / MLASK
View on GitHub
EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"
☆11Nov 7, 2023Updated 2 years ago
xzc-zju / AdaVideoRAG
View on GitHub
[NeurIPS 2025] AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding
☆15Jun 16, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bolajixi / Mulitimodal-Speech-Emotion-Recognition
View on GitHub
A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data
☆12May 16, 2022Updated 4 years ago
wmeiqi / C3D-R-2-1-D-R3D
View on GitHub
C3D,R(21)D,R3D--pytorch
☆10Sep 11, 2018Updated 7 years ago
linjieli222 / HERO_Video_Feature_Extractor
View on GitHub
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆118Jun 9, 2021Updated 5 years ago
EIDOSLAB / unbiased-contrastive-learning
View on GitHub
Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN
☆12Sep 22, 2023Updated 2 years ago
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
PardoAlejo / MovieCuts
View on GitHub
Learning to cut end-to-end pretrained modules
☆38Apr 17, 2025Updated last year
DAVEISHAN / TimeBalance
View on GitHub
Placeholder
☆10Jul 17, 2023Updated 3 years ago