usr922 / vgtr
[ICME'22] Visual Grounding with Transformers
☆28Updated 2 years ago
Alternatives and similar repositories for vgtr:
Users that are interested in vgtr are comparing it to the libraries listed below
- ☆34Updated last year
- ☆38Updated last year
- SeqTR: A Simple yet Universal Network for Visual Grounding☆131Updated 2 months ago
- [ICCV2023] PyTorch implementation of ''Spatial-Aware Token for Weakly Supervised Object Localization''.☆19Updated last year
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆66Updated 2 years ago
- A lightweight codebase for referring expression comprehension and segmentation☆52Updated 2 years ago
- ☆89Updated last year
- What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs☆23Updated 2 years ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆66Updated 3 years ago
- An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".☆51Updated 3 years ago
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆23Updated last month
- (TIP 2024) Towards Robust Referring Image Segmentation☆26Updated 10 months ago
- This repo is the official implementation of UPL (Unsupervised Prompt Learning for Vision-Language Models).☆109Updated 2 years ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆94Updated 2 years ago
- Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)☆48Updated 2 years ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆48Updated 11 months ago
- ☆54Updated last year
- Weakly-Supervised-Learning, Semantic Segmentation, CVPR 2023☆62Updated last year
- ☆56Updated 2 years ago
- Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning" (published at ICLR 202…☆57Updated last year
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆144Updated 6 months ago
- Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))☆54Updated last year
- ☆28Updated 2 years ago
- ☆34Updated 2 years ago
- [CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .☆20Updated 2 years ago
- Multi-label Image Recognition with Partial Labels (IJCV'24, ESWA'24, AAAI'22)☆35Updated 6 months ago
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆86Updated last year
- [ECCV'22 Poster] Explicit Image Caption Editing☆21Updated 2 years ago
- PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"☆44Updated 3 months ago