xmu-xiaoma666 / SDATRLinks
Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
☆19Updated 3 years ago
Alternatives and similar repositories for SDATR
Users that are interested in SDATR are comparing it to the libraries listed below
Sorting:
- Lightweight Transformer for Multi-modal Tasks☆16Updated 2 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆106Updated 2 years ago
- ☆57Updated 4 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Updated 2 years ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆55Updated last year
- PyTorch implementation of Omni-DETR for omni-supervised object detection: https://arxiv.org/abs/2203.16089☆69Updated 3 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- Official code for CVPR 2022 paper "Relieving Long-tailed Instance Segmentation via Pairwise Class Balance".☆37Updated 3 years ago
- Dynamic Early Exit for Image Captioning☆17Updated 3 years ago
- Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)☆39Updated 2 years ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆53Updated 2 years ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated 2 years ago
- Towards Local Visual Modeling for Image Captioning☆29Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- ☆43Updated 4 years ago
- Official Implementation of DE-DETR and DELA-DETR in "Towards Data-Efficient Detection Transformers"☆79Updated last year
- [CVPR 2023] This is the official PyTorch implementation for "Dynamic Focus-aware Positional Queries for Semantic Segmentation".☆60Updated 2 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 3 years ago
- Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning☆20Updated last year
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated 2 years ago
- ☆34Updated 2 years ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆30Updated 2 years ago
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Updated 3 years ago
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆31Updated 3 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆53Updated 3 years ago
- Official Implementation of "FP-DETR: Detection Transformer Advanced by Fully Pre-training"☆63Updated 3 years ago
- "Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8☆28Updated 4 years ago