idiap / sharinganLinks
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
☆21Updated 8 months ago
Alternatives and similar repositories for sharingan
Users that are interested in sharingan are comparing it to the libraries listed below
Sorting:
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆41Updated 7 months ago
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆24Updated 2 years ago
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆70Updated last month
- This work is accepted by CVPR2023☆36Updated last year
- A new model for gait emotion recognition☆13Updated last year
- Official PyTorch repository for GRAM☆80Updated 2 months ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆48Updated last year
- Curated list of video object segmentation (VOS) papers, datasets, and projects.☆352Updated this week
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆19Updated last year
- Code for Diffusion Action Segmentation (ICCV 2023)☆64Updated last year
- [CVPR'24] Official implementation of our paper "Self-Supervised Facial Representation Learning with Facial Region Awareness"☆13Updated last year
- Code repository for "Post-pre-training for Modality Alignment in Vision-Language Foundation Models" (CVPR2025)☆22Updated 2 months ago
- [AAAI 2024] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"☆36Updated last year
- ☆252Updated last year
- Official repository of the GraSP dataset and implemention of TAPIS☆32Updated 6 months ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆656Updated 9 months ago
- Implementation of the CVPR 2024 paper "A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation"☆19Updated 6 months ago
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆273Updated 2 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆132Updated 2 years ago
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆12Updated 4 months ago
- [CVPR 2024 Oral] MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation.☆165Updated 11 months ago
- An easy way to apply LoRA to CLIP. Implementation of the paper "Low-Rank Few-Shot Adaptation of Vision-Language Models" (CLIP-LoRA) [CVPR…☆221Updated last month
- ☆16Updated last year
- A curated list of awesome temporal action segmentation resources.☆210Updated last year
- The suite of modeling video with Mamba☆277Updated last year
- Awesome Action Quality Assessment (AQA)☆75Updated last month
- [ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition☆291Updated last year
- ☆13Updated last year
- implementation of "Action Quality Assessment with Temporal Parsing Transformer"☆21Updated 2 years ago
- Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).☆53Updated last year