TencentYoutuResearch / VisualRecognition-NomMer
Code for CVPR 2022 paper "NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition"
☆25Updated 2 years ago
Alternatives and similar repositories for VisualRecognition-NomMer:
Users that are interested in VisualRecognition-NomMer are comparing it to the libraries listed below
- ☆23Updated 2 years ago
- [AAAI 2022] Pytorch implementation of "LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization".☆22Updated 2 years ago
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆16Updated last year
- Refer-Youtube-VOS dataset☆24Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆51Updated last year
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆53Updated last year
- This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It wil…☆18Updated 4 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- The official implementaion of SPA_CVPR2021 paper☆45Updated 3 years ago
- [ECCV 2022] Robust Object Detection With Inaccurate Bounding Boxes☆34Updated last year
- Code of SSAN☆62Updated last year
- RefVOS☆29Updated 4 years ago
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Updated last year
- Salvage of Supervision in Weakly Supervised Object Detection, CVPR 2022☆22Updated 2 years ago
- UWSOD: Toward Fully-Supervised-Level Capacity Weakly Supervised Object Detection☆22Updated 4 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆47Updated 2 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated 2 years ago
- ☆26Updated 2 years ago
- ☆21Updated 3 years ago
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆35Updated last year
- Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning☆99Updated 2 years ago
- ☆47Updated 2 years ago
- ☆56Updated 2 years ago
- The code of our ECCV paper: Domain Adaptive Object Detection via Asymmetric Tri-way Faster-RCNN☆13Updated 4 years ago
- Code release for Your “Flamingo” is My “Bird”: Fine-Grained, or Not (CVPR 2021 Oral)☆60Updated last year
- Unofficial implementation of "SSAN: Separable Self-Attention Network for Video Representation Learning (CVPR2021)", in Pytorch☆8Updated 3 years ago
- [CVPR 2022] Task-specific Inconsistency Alignment for Domain Adaptive Object Detection☆35Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- Official Implementation of AlignMixup - CVPR 2022☆71Updated 3 years ago