KHU-VLL/CAST

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KHU-VLL/CAST)

KHU-VLL / CAST

[NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"

☆55

Alternatives and similar repositories for CAST

Users that are interested in CAST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Haawron / SLURM_allocated_gres_visualizer
View on GitHub
The app for visualizing allocated GPUs by SLURM
☆13Jan 21, 2024Updated 2 years ago
YorkUCVIL / VTCD
View on GitHub
☆19Jun 22, 2024Updated 2 years ago
naver-ai / class-query-vad
View on GitHub
[ECCV 2024] Official PyTorch implementation of "Classification Matters: Improving Video Action Detection with Class-Specific Attention"
☆18Nov 8, 2024Updated last year
KHU-VLL / KHU_Vision_and_Learning_Reading_Group
View on GitHub
Kyung Hee University Vision and Learning Reading Group
☆49Updated this week
KHU-VLL / DEVIAS
View on GitHub
[ECCV 2024 Oral] Official implementation of the paper "DEVIAS: Learning Disentangled Video Representations of Action and Scene"
☆29Nov 15, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MCG-NJU / MGMAE
View on GitHub
[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding
☆26Oct 16, 2023Updated 2 years ago
ruiwang2021 / mvd
View on GitHub
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…
☆135May 21, 2023Updated 3 years ago
whwu95 / ATM
View on GitHub
【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?
☆74Jan 26, 2024Updated 2 years ago
dehezhang2 / PointNeRF-Assistant
View on GitHub
This is a repository is an assistant to run PointNeRF. We set up a stable environment for point-nerf for ubuntu 20.04, and modified point…
☆22Jun 19, 2023Updated 3 years ago
IntelligentNetworkingLAB / Deep-Learning-Model-Generator
View on GitHub
☆21Nov 28, 2022Updated 3 years ago
denfed / heartheflow
View on GitHub
Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"
☆12Dec 21, 2022Updated 3 years ago
Francis-Rings / ILA
View on GitHub
[ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition
☆41Nov 29, 2023Updated 2 years ago
marcopede / AreasOfAttention
View on GitHub
☆10Apr 20, 2018Updated 8 years ago
KU-VGI / HMDC
View on GitHub
Official Repository for Heterogeneous Models Dataset Condensation (ECCV 2024, Oral)
☆10Dec 15, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
VisualAIKHU / SIRA-SSL
View on GitHub
Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)
☆18Nov 14, 2023Updated 2 years ago
leexinhao / ZeroI2V
View on GitHub
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
☆20Jul 29, 2024Updated last year
OpenGVLab / VideoMAEv2
View on GitHub
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
☆804Oct 8, 2024Updated last year
daniel-code / TubeViT
View on GitHub
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
☆95Jul 15, 2026Updated last week
jhCOR / EgoOrientBench
View on GitHub
The Official Code Repo for EgoOrientBench [CVPR25]
☆17Nov 24, 2025Updated 8 months ago
MCG-NJU / AMD
View on GitHub
[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
☆18Jan 11, 2026Updated 6 months ago
lisiqi19971013 / SuperFast
View on GitHub
SuperFast: 200× Video Frame Interpolation via Event Camera
☆26Apr 19, 2024Updated 2 years ago
Sarinda251 / CDFSL-V
View on GitHub
Accepted at ICCV '23
☆16Oct 4, 2023Updated 2 years ago
junkwhinger / adversarial_complementary_learning
View on GitHub
☆11Jul 3, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JH-LEE-KR / l2p-pytorch
View on GitHub
PyTorch Implementation of Learning to Prompt (L2P) for Continual Learning @ CVPR22
☆204Oct 14, 2023Updated 2 years ago
ZhonghuaYi / FocusFlow_official
View on GitHub
FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving
☆11Jan 22, 2024Updated 2 years ago
LiChenyang-Github / LongShortNet
View on GitHub
LongShortNet for Streaming Perception task.
☆13Aug 27, 2023Updated 2 years ago
zyxia1009 / CVPR2024-TSPNet
View on GitHub
(CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization
☆20Jun 11, 2024Updated 2 years ago
dominickrei / pi-vit
View on GitHub
[CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living
☆31Nov 12, 2025Updated 8 months ago
HJYao00 / Side4Video
View on GitHub
☆42Apr 7, 2024Updated 2 years ago
TalalWasim / Vita-CLIP
View on GitHub
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆126Jul 1, 2023Updated 3 years ago
HighwayWu / ST-DDL
View on GitHub
☆16Feb 27, 2025Updated last year
Optimization-AI / FastCLIP
View on GitHub
Distributed Optimization Infra for learning CLIP models
☆31Oct 3, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
musicalOffering / ActionSwitch-release
View on GitHub
☆12Aug 7, 2024Updated last year
whwu95 / BIKE
View on GitHub
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
☆156Sep 9, 2024Updated last year
basiclab / FreeCond
View on GitHub
FreeCond: A Free Lunch for Input Conditions in Text-Guided Inpainting. FreeCond introduces a more generalized form💪 of the original inpa…
☆15May 22, 2025Updated last year
Divya-Bhargavi / isolation-forest
View on GitHub
Anomaly detection using isolation forest
☆11Apr 15, 2019Updated 7 years ago
thearkaprava / MS-Temba
View on GitHub
[CVPR 2026] Official Repository of 'MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos'
☆48Jun 22, 2026Updated last month
rulixiang / vwe
View on GitHub
[IJCAI 2021 & IJCV 2022] Learning Visual Words for Weakly-Supervised Semantic Segmentation
☆28Feb 11, 2022Updated 4 years ago
DAVEISHAN / TimeBalance
View on GitHub
Placeholder
☆10Jul 17, 2023Updated 3 years ago