xuyankun / VSViGLinks
Official implementation of VSViG
☆15Updated 7 months ago
Alternatives and similar repositories for VSViG
Users that are interested in VSViG are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆25Updated 5 months ago
- This repo is official implementation of the paper "Multimodal transformer for Nurse Activity Recognition", published in CVPM2022, CVPRW.☆18Updated last year
- A human-computer interaction system that combines eye tracking with Segment Anything Model (SAM), and it enables users to segment object …☆62Updated 2 years ago
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆52Updated last year
- Placeholder☆10Updated 2 years ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆38Updated last year
- [ICCV 2023] Data-Free Class-Incremental Hand Gesture Recognition☆14Updated last year
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- ☆12Updated 10 months ago
- Coach-Project☆13Updated 5 months ago
- Official Repository of 'Multi-Scale Temporal Mamba for Efficient Temporal Action Detection'☆22Updated this week
- PoseRAC: Pose Saliency Transformer for Repetitive Action Counting☆16Updated 2 years ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆54Updated 2 months ago
- ☆23Updated last year
- [TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recog…☆20Updated last year
- ☆13Updated last month
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆41Updated last year
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆101Updated last year
- [TAC 2024] SVFAP: Self-supervised Video Facial Affect Perceiver☆19Updated 9 months ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆27Updated 4 months ago
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆20Updated last year
- [ISBI 2025] XLSTM-HVED: Cross-Modal Brain Tumor Segmentation and MRI Reconstruction Method Using Vision XLSTM and Heteromodal Variational…☆14Updated last week
- Official code for NeurIPS 2023 paper "Self-Supervised Motion Magnification by Backpropagating Through Optical Flow"☆34Updated last year
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆70Updated last month
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆26Updated 2 months ago
- QAFE-Net: Quality Assessment of Facial Expressions with Landmark Heatmaps☆13Updated last year
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆45Updated 7 months ago
- An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis (CVPR'21)☆44Updated 2 years ago
- ☆19Updated 2 months ago
- Gait Abnormality in Video Dataset (GAVD) is the largest collection of online links to gait videos with clinical annotations. The dataset …☆16Updated 3 months ago