Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021
☆67May 26, 2022Updated 4 years ago
Alternatives and similar repositories for RefTR
Users that are interested in RefTR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆97Dec 2, 2022Updated 3 years ago
- Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021☆20Aug 17, 2021Updated 4 years ago
- ☆234Apr 13, 2023Updated 3 years ago
- ☆198Feb 27, 2024Updated 2 years ago
- A lightweight codebase for referring expression comprehension and segmentation☆57May 21, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆92Apr 15, 2022Updated 4 years ago
- ☆41Jun 3, 2022Updated 4 years ago
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆299Nov 29, 2022Updated 3 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Oct 30, 2024Updated last year
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆91Sep 30, 2021Updated 4 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Jun 6, 2022Updated 4 years ago
- ☆10Jan 9, 2025Updated last year
- [CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)☆139Aug 4, 2022Updated 3 years ago
- awesome grounding: A curated list of research papers in visual grounding☆1,125Sep 21, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A curated list of research papers in Referring Expression Comprehension (REC)☆47May 13, 2021Updated 5 years ago
- A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)☆151Nov 18, 2020Updated 5 years ago
- A benchmark dataset for GREx: GRES, GREC, and GREG [CVPR 2023 & IJCV 2026]☆241Nov 14, 2025Updated 7 months ago
- Lightweight Transformer for Multi-modal Tasks☆16Dec 9, 2022Updated 3 years ago
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Aug 22, 2020Updated 5 years ago
- A collection of papers about Referring Image Segmentation.☆826Jan 28, 2026Updated 5 months ago
- Referring Expression Parser☆27Feb 10, 2018Updated 8 years ago
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆34Jul 12, 2023Updated 2 years ago
- [CVPR2022] Official Implementation of ReferFormer☆354Feb 15, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Sep 4, 2022Updated 3 years ago
- [CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.☆51Aug 31, 2021Updated 4 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"☆22Dec 20, 2020Updated 5 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆59Nov 28, 2022Updated 3 years ago
- ☆16Nov 14, 2022Updated 3 years ago
- ☆1,049Oct 3, 2022Updated 3 years ago
- An official PyTorch implementation of the CRIS paper☆281Jun 9, 2024Updated 2 years ago
- ☆47Oct 3, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…☆17Nov 24, 2024Updated last year
- This is an implementation of "Grounding of Textual Phrases in Images by Reconstruction" in PyTorch☆18Apr 7, 2020Updated 6 years ago
- Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”☆49Nov 10, 2022Updated 3 years ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆32Dec 7, 2023Updated 2 years ago
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆115Mar 28, 2026Updated 3 months ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Jan 18, 2023Updated 3 years ago
- ☆17Sep 15, 2024Updated last year