chenxi-Guo / TransGOPLinks
☆11Updated last year
Alternatives and similar repositories for TransGOP
Users that are interested in TransGOP are comparing it to the libraries listed below
Sorting:
- This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application…☆25Updated 3 years ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆45Updated last year
- Sharingan: A Transformer Architecture for Multi-Person Gaze Following☆26Updated last year
- The Code For ''Recurring the Transformer for Video Action Recognition''☆14Updated 2 years ago
- [CVPR 2025] Official implementation of the paper "DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for T…☆26Updated 7 months ago
- [CVPR'24] Official implementation of our paper "Self-Supervised Facial Representation Learning with Facial Region Awareness"☆14Updated last year
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆748Updated last year
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆316Updated 9 months ago
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆205Updated 2 years ago
- [AAAI 2024] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"☆42Updated last year
- GaitParsing: Human Semantic Parsing for Gait Recognition (IEEE TMM)☆12Updated last year
- Code release for ActionFormer (ECCV 2022)☆537Updated last year
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆266Updated 10 months ago
- [ACM MM 2025] This repository is the official implementation of the paper "Motion Matters: Motion-guided Modulation Network for Skeleton-…☆20Updated 2 months ago
- About [MM2024] Learning with Alignments: Tackling the Inter- and Intra-domain Shifts for Cross-multidomain Facial Expression Recognition☆13Updated last year
- [CVPR2024 Highlight] Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images☆235Updated last year
- [Neural Networks 2025]Text-guided Image Restoration and Semantic Enhancement for Text-to-Image Person Retrieval☆11Updated last year
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆339Updated last year
- Source code of the paper Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification☆36Updated last year
- Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI …☆454Updated 2 years ago
- IEEE TMI paper: A multi-step modality fusion network for identifying the histologic subtypes of metastatic cervical lymphadenopathy☆10Updated 3 years ago
- [CVPR 2025 Highlight] PyTorch implementation of "Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-…☆136Updated 5 months ago
- ☆12Updated 2 years ago
- ☆13Updated last year
- ☆39Updated 3 years ago
- Awesome Fine-grained Visual Classification☆238Updated 2 years ago
- ☆28Updated 6 months ago
- ☆18Updated last year
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆339Updated last year
- [ICCV2025] DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy☆41Updated 2 months ago