xuyankun / VSViGLinks
Official implementation of VSViG
☆19Updated 3 months ago
Alternatives and similar repositories for VSViG
Users that are interested in VSViG are comparing it to the libraries listed below
Sorting:
- PoseRAC: Pose Saliency Transformer for Repetitive Action Counting☆19Updated 2 years ago
- [MICCAI 2025] Official code implementation for paper: ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tra…☆36Updated 3 months ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆79Updated 3 months ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆23Updated 2 years ago
- Gait Abnormality in Video Dataset (GAVD) is the largest collection of online links to gait videos with clinical annotations. The dataset …☆30Updated 2 months ago
- A human-computer interaction system that combines eye tracking with Segment Anything Model (SAM), and it enables users to segment object …☆64Updated 2 years ago
- Coach-Project☆15Updated 2 months ago
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆30Updated 3 months ago
- The extension of this dataset (APTv2) can be found at:☆55Updated 2 years ago
- Code for GaitForeMer.☆21Updated 2 years ago
- ☆13Updated last year
- [ICCV 2023] Rethinking pose estimation in crowds: overcoming the detection information-bottleneck and ambiguity☆101Updated last year
- [ICCV 25] The official repository of paper 'Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle'☆188Updated last week
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆63Updated 9 months ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆30Updated 11 months ago
- Multi-View Operating Room (MVOR) dataset consists of synchronized multi-view frames recorded by three RGB-D cameras in a hybrid OR during…☆69Updated 8 months ago
- Pruned CoTracker architecture for tracking the myocardium in 2D echo images.☆19Updated 9 months ago
- [NeurIPS 2024 Workshop AIM-FM] Official code implementation for paper: Surgical SAM 2☆68Updated 9 months ago
- A new model for gait emotion recognition☆15Updated last year
- Sharingan: A Transformer Architecture for Multi-Person Gaze Following☆26Updated last year
- [CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding☆153Updated last year
- The official repo for the extension of [NeurIPS'22] "APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking": https://g…☆28Updated last year
- Official code of the paper "EgoExOR: An Egocentric–Exocentric Operating Room Dataset for Comprehensive Understanding of Surgical Activiti…☆24Updated 3 weeks ago
- An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis (CVPR'21)☆47Updated 3 years ago
- ☆32Updated 2 years ago
- [MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"☆49Updated 2 months ago
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning☆40Updated 5 months ago
- Video-based gait analysis for assessing Alzheimer’s Disease and Dementia with Lewy Bodies☆17Updated last year
- [IJCAI 2022] Official PyTorch implementation of AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation☆29Updated 3 years ago
- Official code of the paper MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environ…☆52Updated 5 months ago