Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021
☆68May 26, 2022Updated 3 years ago
Alternatives and similar repositories for RefTR
Users that are interested in RefTR are comparing it to the libraries listed below
Sorting:
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆96Dec 2, 2022Updated 3 years ago
- Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021☆19Aug 17, 2021Updated 4 years ago
- A lightweight codebase for referring expression comprehension and segmentation☆57May 21, 2022Updated 3 years ago
- ☆221Apr 13, 2023Updated 2 years ago
- ☆195Feb 27, 2024Updated 2 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Oct 30, 2024Updated last year
- ☆87Apr 15, 2022Updated 3 years ago
- ☆10Jan 9, 2025Updated last year
- IROS 2023 "VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes"☆54Apr 22, 2024Updated last year
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆298Nov 29, 2022Updated 3 years ago
- [CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)☆139Aug 4, 2022Updated 3 years ago
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Aug 22, 2020Updated 5 years ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆32Dec 7, 2023Updated 2 years ago
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆358Jan 7, 2022Updated 4 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Nov 28, 2022Updated 3 years ago
- ☆41Jun 3, 2022Updated 3 years ago
- awesome grounding: A curated list of research papers in visual grounding☆1,125Sep 21, 2025Updated 5 months ago
- Lightweight Transformer for Multi-modal Tasks☆16Dec 9, 2022Updated 3 years ago
- [CVPR2022] Official Implementation of ReferFormer☆352Feb 15, 2025Updated last year
- Referring Expression Parser☆27Feb 10, 2018Updated 8 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆89Sep 30, 2021Updated 4 years ago
- ☆16Nov 14, 2022Updated 3 years ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Jun 6, 2022Updated 3 years ago
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Sep 4, 2022Updated 3 years ago
- A collection of papers about Referring Image Segmentation.☆808Jan 28, 2026Updated last month
- A curated list of research papers in Referring Expression Comprehension (REC)☆46May 13, 2021Updated 4 years ago
- This is an implementation of "Grounding of Textual Phrases in Images by Reconstruction" in PyTorch☆17Apr 7, 2020Updated 5 years ago
- [ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping☆11Feb 7, 2025Updated last year
- A benchmark dataset for GREx: GRES, GREC, and GREG [CVPR 2023 & IJCV 2026]☆240Nov 14, 2025Updated 3 months ago
- A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)☆148Nov 18, 2020Updated 5 years ago
- ☆1,048Oct 3, 2022Updated 3 years ago
- Code for AAAI 2021 paper "SCNet: Traning Inference Sample Consistency for Instance Segmentation".☆22Jan 31, 2021Updated 5 years ago
- Referring Expression Datasets API☆558Aug 27, 2024Updated last year
- ☆57Jan 7, 2023Updated 3 years ago
- Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"☆22Dec 20, 2020Updated 5 years ago
- [IROS 2021] ADD: A Fine-grained Dynamic Inference Architecture for Semantic Image Segmentation☆10May 3, 2022Updated 3 years ago
- ☆14Nov 4, 2022Updated 3 years ago
- An official PyTorch implementation of the CRIS paper☆280Jun 9, 2024Updated last year
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Jan 18, 2023Updated 3 years ago