idiap / sharingan
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
☆11Updated 2 months ago
Alternatives and similar repositories for sharingan:
Users that are interested in sharingan are comparing it to the libraries listed below
- Positive-Negative Equal Contrastive Loss for Semantic Segmentation☆12Updated last year
- Long Surgical Phase Recognition☆17Updated 2 months ago
- Code for Diffusion Action Segmentation (ICCV 2023)☆58Updated last year
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆37Updated last month
- This work is accepted by CVPR2023☆36Updated last year
- [ECCV 2024] The official repo for "SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoder…☆23Updated 5 months ago
- Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).☆49Updated last year
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆19Updated 7 months ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆47Updated 9 months ago
- [AAAI24] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"☆28Updated 10 months ago
- Multimodal Large Models Are Effective Action Anticipators (IEEE TMM)🌳☆15Updated 2 weeks ago
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆23Updated 2 years ago
- [CVPR'24] Official implementation of our paper "Self-Supervised Facial Representation Learning with Facial Region Awareness"☆10Updated 10 months ago
- Official repository of the GraSP dataset and implemention of TAPIS☆20Updated 2 weeks ago
- ☆53Updated 11 months ago
- [CVPR 2024 Oral] MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation.☆140Updated 5 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆115Updated last year
- A new model for gait emotion recognition☆13Updated 9 months ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆299Updated 9 months ago
- [MICCAI 2024] Surgformer: Surgical Transformer with Hierarchical Temporal Attention for Surgical Phase Recognition☆23Updated last week
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆43Updated last week
- Self-Supervised Video Representation Learning with Motion-Aware Masked Autoencoders☆23Updated 5 months ago
- ☆30Updated 7 months ago
- ☆12Updated 4 months ago
- Adaptive FSS has been Accepted by AAAI 2024. A Novel Few-Shot Segmentation Framework via Prototype Enhancement☆36Updated 10 months ago
- 【AAAI 2022】Temporal Action Proposal Generation with Background Constraint☆18Updated 2 years ago
- Official webpage for TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection, accepted at…☆16Updated 2 weeks ago
- implementation of "Action Quality Assessment with Temporal Parsing Transformer"☆19Updated 2 years ago
- Implementation of the CVPR 2024 paper "A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation"☆17Updated last week
- A curated list of Action Quality Assessment and related area resources☆21Updated last week