ubc-vision/RefTR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ubc-vision/RefTR)

ubc-vision / RefTR

Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021

☆67

Alternatives and similar repositories for RefTR

Users that are interested in RefTR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yangli18 / VLTVG
View on GitHub
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
fengguang94 / CEFNet
View on GitHub
Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation, CVPR2021
☆21Aug 17, 2021Updated 4 years ago
yz93 / LAVT-RIS
View on GitHub
☆234Apr 13, 2023Updated 3 years ago
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
LukeForeverYoung / QRNet
View on GitHub
☆41Jun 3, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
luyh20 / VL-Grasp
View on GitHub
IROS 2023 "VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes"
☆61Apr 22, 2024Updated 2 years ago
seanzhuh / SeqTR
View on GitHub
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Oct 30, 2024Updated last year
zyang-ur / ReSC
View on GitHub
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆91Sep 30, 2021Updated 4 years ago
QiuHeqian / mmdetection-ref
View on GitHub
☆10Jan 9, 2025Updated last year
luogen1996 / MCN
View on GitHub
[CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)
☆139Aug 4, 2022Updated 3 years ago
TheShadow29 / awesome-grounding
View on GitHub
awesome grounding: A curated list of research papers in visual grounding
☆1,127Sep 21, 2025Updated 10 months ago
henghuiding / Vision-Language-Transformer
View on GitHub
[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation
☆366Jan 7, 2022Updated 4 years ago
daqingliu / awesome-rec
View on GitHub
A curated list of research papers in Referring Expression Comprehension (REC)
☆47May 13, 2021Updated 5 years ago
henghuiding / gRefCOCO
View on GitHub
A benchmark dataset for GREx: GRES, GREC, and GREG [CVPR 2023 & IJCV 2026]
☆241Nov 14, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
luogen1996 / LWTransformer
View on GitHub
Lightweight Transformer for Multi-modal Tasks
☆16Dec 9, 2022Updated 3 years ago
MarkMoHR / Awesome-Referring-Image-Segmentation
View on GitHub
A collection of papers about Referring Image Segmentation.
☆826Jan 28, 2026Updated 5 months ago
lichengunc / refer-parser2
View on GitHub
Referring Expression Parser
☆27Feb 10, 2018Updated 8 years ago
wjn922 / ReferFormer
View on GitHub
[CVPR2022] Official Implementation of ReferFormer
☆355Feb 15, 2025Updated last year
zhjohnchan / SK-VG
View on GitHub
[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.
☆34Jul 12, 2023Updated 3 years ago
mengcaopku / DCNet
View on GitHub
[ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension
☆15Sep 4, 2022Updated 3 years ago
svip-lab / LBYLNet
View on GitHub
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
☆51Aug 31, 2021Updated 4 years ago
qumengxue / RIO
View on GitHub
☆13Oct 30, 2023Updated 2 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ChopinSharp / ref-nms
View on GitHub
Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"
☆22Dec 20, 2020Updated 5 years ago
DafaRen / Learning_Bifunctional_Push-grasping_Synergistic_Strategy_for_Goal-agnostic_and_Goal-oriented_Tasks
View on GitHub
☆14Nov 4, 2022Updated 3 years ago
hjy-u / ETOG
View on GitHub
[ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
☆13Feb 7, 2025Updated last year
IDEA-Research / DQ-DETR
View on GitHub
[AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
☆58Nov 28, 2022Updated 3 years ago
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
chihhuiho / yoro
View on GitHub
☆16Nov 14, 2022Updated 3 years ago
ashkamath / mdetr
View on GitHub
☆1,050Oct 3, 2022Updated 3 years ago
DerrickWang005 / CRIS.pytorch
View on GitHub
An official PyTorch implementation of the CRIS paper
☆281Jun 9, 2024Updated 2 years ago
toggle1995 / RIS-DMMI
View on GitHub
☆47Oct 3, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
thunlp / PEVL
View on GitHub
Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”
☆49Nov 10, 2022Updated 3 years ago
Mxbonn / ltmp
View on GitHub
Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…
☆17Nov 24, 2024Updated last year
acambray / GroundeR-PyTorch
View on GitHub
This is an implementation of "Grounding of Textual Phrases in Images by Reconstruction" in PyTorch
☆18Apr 7, 2020Updated 6 years ago
eraserNut / MedRPG
View on GitHub
☆22Jun 20, 2024Updated 2 years ago
insomnia94 / ISREG
View on GitHub
iterative shrinking for referring expression grounding using deep reinforcement learning
☆14Nov 27, 2021Updated 4 years ago
ChenyunWu / PhraseCutDataset
View on GitHub
Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"
☆116Mar 28, 2026Updated 3 months ago
facebookresearch / CiT
View on GitHub
Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".
☆78Jan 18, 2023Updated 3 years ago