idiap / sharinganLinks
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
☆23Updated last year
Alternatives and similar repositories for sharingan
Users that are interested in sharingan are comparing it to the libraries listed below
Sorting:
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆45Updated last year
- This work is accepted by CVPR2023☆36Updated 2 years ago
- ☆33Updated 9 months ago
- A new model for gait emotion recognition☆15Updated last year
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆134Updated 2 years ago
- 【CVPR2023】GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments☆31Updated 2 years ago
- PoseRAC: Pose Saliency Transformer for Repetitive Action Counting☆17Updated 2 years ago
- [CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment☆46Updated last year
- Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).☆56Updated 2 years ago
- 🔥 Latest advances in Video Object Segmentation (VOS) – papers, datasets, and projects.☆443Updated 2 weeks ago
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆25Updated 3 years ago
- Code for "LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model", CVPR 2024 Highlight☆59Updated last year
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆82Updated 7 months ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆46Updated last year
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆18Updated 2 months ago
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆12Updated last year
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆72Updated last year
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆64Updated last year
- ☆16Updated last year
- ☆256Updated 2 years ago
- [ECCV 2024🔥] The official code for the paper AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors.☆58Updated last year
- ☆33Updated 6 months ago
- Official pytorch implementation of MuST: Multi-Scale Transformers for Surgical Phase Recognition MICCAI 2024☆13Updated 11 months ago
- [CVPR'24] Official implementation of our paper "Self-Supervised Facial Representation Learning with Facial Region Awareness"☆14Updated last year
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆94Updated 11 months ago
- Awesome Action Quality Assessment (AQA)☆113Updated last week
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆55Updated 2 years ago
- A curated list of Action Quality Assessment and related area resources☆26Updated 4 months ago
- Easy wrapper for inserting LoRA layers in CLIP.☆40Updated last year
- Code for Diffusion Action Segmentation (ICCV 2023)☆72Updated 2 years ago