Sharingan: A Transformer Architecture for Multi-Person Gaze Following
☆26Nov 11, 2024Updated last year
Alternatives and similar repositories for sharingan
Users that are interested in sharingan are comparing it to the libraries listed below
Sorting:
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆25Oct 18, 2022Updated 3 years ago
- 【CVPR2023】GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments☆32Oct 16, 2023Updated 2 years ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆45Dec 5, 2024Updated last year
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆63Mar 3, 2025Updated 11 months ago
- An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"☆19Dec 5, 2024Updated last year
- The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)☆20Nov 7, 2024Updated last year
- This is the code for ACMMM 2020 paper 'Hybrid Dynamic-static Context-aware Attention Network for Action Assessment in Long Videos'.☆24Mar 19, 2024Updated last year
- ☆10Jul 30, 2024Updated last year
- The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…☆13Jun 16, 2025Updated 8 months ago
- ☆11Oct 30, 2024Updated last year
- ☆11Jun 13, 2025Updated 8 months ago
- ☆10Sep 24, 2024Updated last year
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated last year
- [ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition☆16Sep 29, 2025Updated 5 months ago
- CVMHT : Complementary-View Multiple Human Tracking (AAAI 2020).☆10Dec 9, 2021Updated 4 years ago
- [ICCV'23] FSI: Frequency and Spatial Interactive Learning for Image Restoration in Under-Display Cameras☆10Jan 3, 2024Updated 2 years ago
- experimenting☆12Jul 26, 2023Updated 2 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- ☆24Oct 9, 2025Updated 4 months ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- node.js bindings for Azure Speech SDK☆13Nov 18, 2025Updated 3 months ago
- API serving for your diffusers models☆11Jan 19, 2024Updated 2 years ago
- Create your own 3D scene with words anywhere.☆29Updated this week
- ☆13Nov 28, 2021Updated 4 years ago
- ☆14Jun 13, 2024Updated last year
- The public reproducible analysis code used for the gaze project☆11Feb 21, 2026Updated last week
- [ICCV 2025] Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction☆23Oct 1, 2025Updated 5 months ago
- The dataset and codes of the paper UniMod1K: Towards a More Universal Large-Scale Dataset and Benchmark for Multi-Modal Learning.☆16Sep 21, 2025Updated 5 months ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- ☆11Dec 15, 2025Updated 2 months ago
- the implementation of TMNN. The paper is Dynamic Cardiac MRI Reconstruction Using Combined Tensor Nuclear Norm and Casorati Matrix Nuclea…☆11May 31, 2022Updated 3 years ago
- ☆15Dec 2, 2025Updated 2 months ago
- Official implementation and project page of the CVPR'24 paper "VMINer: Versatile Multi-view Inverse Rendering with Near- and Far-field Li…☆13Aug 6, 2024Updated last year
- Image Classification Tutorial: ConvNext--> 98.8% on CIFAR10 + 92.4% on CIFAR100; ResNet18 -- 95.6% on CIFAR10 + 79.1% on CIFAR100☆13Jun 2, 2025Updated 8 months ago
- diffusers with search engine☆11Jan 13, 2026Updated last month
- The Pytorch implemetation of "FeatWalk: Enhancing Few-Shot Classification through Local View Leveraging", AAAI 2024.☆11Mar 4, 2024Updated last year
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- [ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects☆57Sep 17, 2024Updated last year