idiap / sharinganLinks
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
☆26Updated last year
Alternatives and similar repositories for sharingan
Users that are interested in sharingan are comparing it to the libraries listed below
Sorting:
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆45Updated last year
- This work is accepted by CVPR2023☆36Updated 2 years ago
- 【CVPR2023】GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments☆32Updated 2 years ago
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆61Updated 11 months ago
- Code for "LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model", CVPR 2024 Highlight☆60Updated last year
- 🔥 Latest advances in Video Object Segmentation (VOS) – papers, datasets, and projects.☆455Updated last week
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆25Updated 3 years ago
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆72Updated last year
- A new model for gait emotion recognition☆15Updated last year
- Official Implementation for ICCV'23 paper Coarse-to-Fine Amodal Segmentation with Shape Prior (C2F-Seg).☆56Updated 2 years ago
- ☆35Updated 10 months ago
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆52Updated last year
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆64Updated last year
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆18Updated 3 months ago
- A list of referring video object segmentation papers☆57Updated 8 months ago
- An official code for "A Decoupled Spatio-Temporal Framework for Skeleton-based Action Segmentation".☆37Updated 2 years ago
- ☆134Updated last year
- Datasets and Papers (with codes) discussed in "Deep Learning for Video Object Segmentation: A Review", Artificial Intelligence Review, 20…☆54Updated 2 years ago
- [CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment☆46Updated last year
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆95Updated last year
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆134Updated 2 years ago
- The suite of modeling video with Mamba☆289Updated last year
- [NeurIPS 2024] Official code for paper "EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection"☆42Updated 7 months ago
- Github repo for referring atomic video action recognition☆20Updated last year
- ☆257Updated 2 years ago
- OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework☆12Updated 11 months ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆29Updated last year
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆204Updated last year
- (CVPR25) Exploring Contextual Attribute Density in Referring Expression Counting☆19Updated 2 months ago