idiap / sharinganLinks
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
☆18Updated 6 months ago
Alternatives and similar repositories for sharingan
Users that are interested in sharingan are comparing it to the libraries listed below
Sorting:
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆24Updated 2 years ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆41Updated 6 months ago
- 【CVPR2023】GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments☆28Updated last year
- Official code for the CVPR 2023 paper "Source-free Adaptive Gaze Estimation by Uncertainty Reduction".☆22Updated last year
- [CVPR'24] Official implementation of our paper "Self-Supervised Facial Representation Learning with Facial Region Awareness"☆12Updated last year
- A new model for gait emotion recognition☆13Updated last year
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆62Updated last week
- Code for Diffusion Action Segmentation (ICCV 2023)☆64Updated last year
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆18Updated 11 months ago
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆55Updated 3 months ago
- The project for the paper titled "MediSee: Reasoning-based Pixel-level Perception in Medical Images"☆16Updated last month
- Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).☆50Updated last year
- An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"☆18Updated 6 months ago
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆61Updated 5 months ago
- [IEEE SPL] End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context☆62Updated last year
- implementation of "Action Quality Assessment with Temporal Parsing Transformer"☆21Updated 2 years ago
- Pytorch Implementation of paper '' A Module Selection-based Approach for Efficient Skeleton Human Action Recognition''☆12Updated 2 weeks ago
- ☆16Updated last year
- The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".☆19Updated 2 years ago
- This work is accepted by CVPR2023☆36Updated last year
- [AAAI 2024] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"☆33Updated last year
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆15Updated 3 months ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated last year
- SMG source code and dataset☆17Updated 2 years ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆48Updated last year
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆26Updated 3 months ago
- The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"☆26Updated 4 months ago
- ☆40Updated last year
- ☆29Updated 5 months ago
- [AAAI 2023 Oral] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"☆21Updated 2 years ago