TencentYoutuResearch / VisualRecognition-NomMer
Code for CVPR 2022 paper "NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition"
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for VisualRecognition-NomMer
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆12Updated 10 months ago
- ☆32Updated 11 months ago
- TCPNet☆30Updated 2 years ago
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆50Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆39Updated 9 months ago
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆17Updated last year
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆22Updated 2 years ago
- ☆10Updated last year
- ☆23Updated last year
- This is the official code repo for "RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?"☆36Updated 2 years ago
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Updated 11 months ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated last year
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆45Updated 2 years ago
- The official implementation of 'Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation' (CVPR 2…☆45Updated 2 years ago
- Unofficial implementation of "SSAN: Separable Self-Attention Network for Video Representation Learning (CVPR2021)", in Pytorch☆8Updated 3 years ago
- PyTorch unofficial implementation of Graph-Based Global Reasoning (http://openaccess.thecvf.com/content_CVPR_2019/papers/Chen_Graph-Based…☆38Updated 3 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated 10 months ago
- [AAAI 2022] Pytorch implementation of "LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization".☆23Updated 2 years ago
- Global Reasoning unit (GloRe)☆19Updated 5 years ago
- ☆32Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Updated 2 years ago
- [ECCV 2022] Robust Object Detection With Inaccurate Bounding Boxes☆34Updated last year
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated last year
- RSTPReid Dataset for Text-based Person Retrieval.☆24Updated 2 years ago
- code base for vision transformers☆36Updated 2 years ago
- Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)☆29Updated 3 years ago
- PyTorch implementation of Omni-DETR for omni-supervised object detection: https://arxiv.org/abs/2203.16089☆66Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated last year
- RefVOS☆28Updated 3 years ago