francescotonini / human-gaze-target-detection-transformer
An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"
☆17Updated 3 months ago
Alternatives and similar repositories for human-gaze-target-detection-transformer:
Users that are interested in human-gaze-target-detection-transformer are comparing it to the libraries listed below
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆39Updated 3 months ago
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆24Updated 2 years ago
- [TAC 2024] SVFAP: Self-supervised Video Facial Affect Perceiver☆16Updated 6 months ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆32Updated last year
- ☆55Updated 8 months ago
- SMG source code and dataset☆16Updated last year
- [Heliyon 2023] The implementation of paper "Activity Recognition in Children with Autism-Related Behaviors"☆19Updated 2 years ago
- ☆132Updated last year
- ☆50Updated last year
- [BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition☆122Updated 4 months ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆37Updated last year
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆57Updated 3 months ago
- [IEEE SPL] End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context☆56Updated last year
- code for: POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition☆55Updated last year
- 【CVPR2023】GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments☆28Updated last year
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆44Updated last year
- Dual Attention Guided Gaze Target Detection in the Wild☆20Updated 2 years ago
- [TIP'21] Learning Deep Global Multi-scale and Local Attention Features for Facial Expression Recognition in the Wild☆87Updated last year
- Official implementation of the NeurIPS2023 paper: Leave No Stone Unturned: Mine Extra Knowledge for Imbalanced Facial Expression Recognit…☆27Updated last year
- [CVPR'24] Official implementation of our paper "Self-Supervised Facial Representation Learning with Facial Region Awareness"☆11Updated last year
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆63Updated last year
- ☆11Updated last month
- [CVPR 2023] Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video☆58Updated last year
- Sharingan: A Transformer Architecture for Multi-Person Gaze Following☆14Updated 4 months ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆96Updated 11 months ago
- ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition☆43Updated last year
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆31Updated last year
- ☆12Updated 2 years ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆27Updated 4 months ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆63Updated 3 weeks ago