Sharingan: A Transformer Architecture for Multi-Person Gaze Following
☆29Nov 11, 2024Updated last year
Alternatives and similar repositories for sharingan
Users that are interested in sharingan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Apr 26, 2024Updated 2 years ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆46Dec 5, 2024Updated last year
- 【CVPR2023】GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments☆32Oct 16, 2023Updated 2 years ago
- Code for the CVPRW GAZE 2021 paper -- GOO : A Dataset for Gaze Object Prediction in Retail Environments☆51Apr 23, 2024Updated 2 years ago
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆63Mar 3, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"☆20Dec 5, 2024Updated last year
- Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)☆836Mar 18, 2026Updated last month
- This repository is the official implementation of GaTector, which studies the newly proposed task, gaze object prediction. In this work, …☆60Sep 11, 2023Updated 2 years ago
- Repository for 3DV2022 paper "Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery"☆19Mar 22, 2023Updated 3 years ago
- Paper reading: Jamba — Hybrid Transformer-Mamba LM (SSM → S4 → S6 → Jamba)☆15May 22, 2024Updated last year
- What Do You See in Vehicle? Comprehensive Vision Solution for In-Vehicle Gaze Estimation☆49Aug 12, 2024Updated last year
- This is the code for ACMMM 2020 paper 'Hybrid Dynamic-static Context-aware Attention Network for Action Assessment in Long Videos'.☆26Mar 19, 2024Updated 2 years ago
- Code for ACCV2018 paper 'Believe It or Not, We Know What You Are Looking at!'☆112Jul 9, 2021Updated 4 years ago
- Official repository of the "ReSTR: Convolution-Free Referring Image Segmentation Using Transformers (CVPR'22)"☆14Dec 13, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆54Jan 20, 2024Updated 2 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- Multimodal Large Models Are Effective Action Anticipators (IEEE TMM)🌳☆26Aug 15, 2025Updated 8 months ago
- ☆11Oct 13, 2024Updated last year
- STOI loss functions in PyTorch (mirror of https://github.com/mpariente/pytorch_stoi)☆15Aug 6, 2020Updated 5 years ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated 2 years ago
- Reconstruction of highly undersampled radial cardiac MRI with a U-Net☆11Apr 4, 2020Updated 6 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- ☆12Aug 16, 2019Updated 6 years ago
- Official code for the paper "Understanding Co-speech Gestures in-the-wild"☆24Oct 31, 2025Updated 6 months ago
- ☆10Apr 7, 2025Updated last year
- [ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition☆17Sep 29, 2025Updated 7 months ago
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- LAEO-Net++☆21Mar 24, 2021Updated 5 years ago
- ☆24Aug 9, 2025Updated 8 months ago
- Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"☆97Mar 21, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)☆18Jun 23, 2024Updated last year
- [CVPR'24] Official implementation of our paper "Self-Supervised Facial Representation Learning with Facial Region Awareness"☆15Mar 8, 2024Updated 2 years ago
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆14Jan 10, 2024Updated 2 years ago
- ☆11Jun 13, 2025Updated 10 months ago
- ☆119Feb 19, 2024Updated 2 years ago
- Deformable Cross-Attention Transformer for Medical Image Registration (PyTorch)☆20Apr 30, 2025Updated last year
- [ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy☆44Nov 21, 2025Updated 5 months ago