[NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"
☆54Dec 28, 2023Updated 2 years ago
Alternatives and similar repositories for CAST
Users that are interested in CAST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The app for visualizing allocated GPUs by SLURM☆13Jan 21, 2024Updated 2 years ago
- [ECCV 2024 Oral] Official implementation of the paper "DEVIAS: Learning Disentangled Video Representations of Action and Scene"☆29Nov 15, 2025Updated 4 months ago
- Kyung Hee University Vision and Learning Reading Group☆47Apr 6, 2026Updated last week
- ☆18Jun 22, 2024Updated last year
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SSL Video Representation Learning project☆14Jul 8, 2025Updated 9 months ago
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆26Oct 16, 2023Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆135May 21, 2023Updated 2 years ago
- 【ICCV'2023】What Can Simple Arithmetic Operations Do for Temporal Modeling?☆73Jan 26, 2024Updated 2 years ago
- [ICCV 2023] LFS-GAN: Lifelong Few-Shot Image Generation☆22Jan 3, 2024Updated 2 years ago
- Implementation of "CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection" (ICIP 2023)☆45Jul 13, 2024Updated last year
- This is a repository is an assistant to run PointNeRF. We set up a stable environment for point-nerf for ubuntu 20.04, and modified point…☆22Jun 19, 2023Updated 2 years ago
- Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"☆12Dec 21, 2022Updated 3 years ago
- ☆10Apr 20, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆20Jul 29, 2024Updated last year
- Accepted at ICCV '23☆15Oct 4, 2023Updated 2 years ago
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Jan 11, 2026Updated 3 months ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆94Sep 13, 2024Updated last year
- The Official Code Repo for EgoOrientBench [CVPR25]☆15Nov 24, 2025Updated 4 months ago
- ☆10Jun 2, 2024Updated last year
- PyTorch Code for Feature Boosting, Suppression, and Diversification for Fine-Grained Visual Classification☆23Apr 16, 2021Updated 4 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆771Oct 8, 2024Updated last year
- [CVPR 2024] Generative Unlearning for Any Identity☆35Feb 19, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch Implementation of Learning to Prompt (L2P) for Continual Learning @ CVPR22☆200Oct 14, 2023Updated 2 years ago
- LongShortNet for Streaming Perception task.☆13Aug 27, 2023Updated 2 years ago
- FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving☆10Jan 22, 2024Updated 2 years ago
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆21Jun 11, 2024Updated last year
- ☆11Jul 3, 2018Updated 7 years ago
- ☆42Apr 7, 2024Updated 2 years ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆126Jul 1, 2023Updated 2 years ago
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]☆19Jan 27, 2023Updated 3 years ago
- [CVPR 2026] Official Repository of 'MS-Temba: Multi-Scale Temporal Mamba for Understanding Long Untrimmed Videos'☆40Jan 23, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Feb 27, 2025Updated last year
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆31Nov 12, 2025Updated 5 months ago
- Distributed Optimization Infra for learning CLIP models☆29Oct 3, 2024Updated last year
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 3 months ago
- 【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models☆155Sep 9, 2024Updated last year
- Anomaly detection using isolation forest☆11Apr 15, 2019Updated 7 years ago
- FreeCond: A Free Lunch for Input Conditions in Text-Guided Inpainting. FreeCond introduces a more generalized form💪 of the original inpa…☆15May 22, 2025Updated 10 months ago