xmu-xiaoma666 / SDATRLinks
Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
☆19Updated 2 years ago
Alternatives and similar repositories for SDATR
Users that are interested in SDATR are comparing it to the libraries listed below
Sorting:
- Lightweight Transformer for Multi-modal Tasks☆16Updated 2 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated 2 years ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆53Updated last year
- Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)☆38Updated 2 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated last year
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Updated 2 years ago
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆93Updated last year
- PyTorch implementation of Omni-DETR for omni-supervised object detection: https://arxiv.org/abs/2203.16089☆68Updated 2 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆56Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- Towards Local Visual Modeling for Image Captioning☆29Updated 2 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆142Updated 10 months ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆105Updated 2 years ago
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆23Updated 3 years ago
- Official Implementation of DE-DETR and DELA-DETR in "Towards Data-Efficient Detection Transformers"☆78Updated last year
- ☆57Updated 3 years ago
- Dynamic Early Exit for Image Captioning☆17Updated 2 years ago
- Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention (CVPR 2023)☆32Updated 2 years ago
- ☆40Updated 3 years ago
- ☆22Updated 3 years ago
- code base for vision transformers☆36Updated 3 years ago
- Official Implementation of "FP-DETR: Detection Transformer Advanced by Fully Pre-training"☆63Updated 3 years ago
- Official Pytorch implementation of "Visual Recognition with Deep Nearest Centroids". (ICLR2023 Spotlight)☆66Updated 2 years ago
- ☆40Updated last year
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆52Updated last year
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- ☆25Updated 2 years ago
- ☆59Updated 3 years ago