xmu-xiaoma666 / SDATRLinks
Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
☆19Updated 2 years ago
Alternatives and similar repositories for SDATR
Users that are interested in SDATR are comparing it to the libraries listed below
Sorting:
- Lightweight Transformer for Multi-modal Tasks☆16Updated 2 years ago
- Dynamic Early Exit for Image Captioning☆17Updated 2 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Updated 2 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆52Updated 2 years ago
- Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)☆38Updated 2 years ago
- Official Implementation of "FP-DETR: Detection Transformer Advanced by Fully Pre-training"☆63Updated 3 years ago
- ☆57Updated 3 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆57Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated 2 years ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆53Updated last year
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆30Updated 2 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention (CVPR 2023)☆32Updated 2 years ago
- PyTorch implementation of Omni-DETR for omni-supervised object detection: https://arxiv.org/abs/2203.16089☆69Updated 3 years ago
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆93Updated last year
- Official Implementation of DE-DETR and DELA-DETR in "Towards Data-Efficient Detection Transformers"☆79Updated last year
- MMdet2-based reposity about lightweight detection model: Nanodet, PicoDet. Also including detection knowledge distillation method☆14Updated 3 years ago
- code base for vision transformers☆36Updated 3 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆33Updated 3 years ago
- "Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8☆27Updated 4 years ago
- Towards Local Visual Modeling for Image Captioning☆29Updated 2 years ago
- 🔥 🔥 [WACV2024] Mini but Mighty: Finetuning ViTs with Mini Adapters☆19Updated last year
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 3 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Updated 2 years ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆55Updated last year
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 3 years ago
- ☆43Updated 4 years ago
- Official code for CVPR 2022 paper "Relieving Long-tailed Instance Segmentation via Pairwise Class Balance".☆37Updated 3 years ago
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆60Updated 2 years ago