daqingliu/awesome-rec

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/daqingliu/awesome-rec)

daqingliu / awesome-rec

A curated list of research papers in Referring Expression Comprehension (REC)

☆46

Alternatives and similar repositories for awesome-rec

Users that are interested in awesome-rec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChopinSharp / ref-nms
View on GitHub
Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"
☆22Dec 20, 2020Updated 5 years ago
daqingliu / NMTree
View on GitHub
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
☆38Nov 23, 2019Updated 6 years ago
zyang-ur / ReSC
View on GitHub
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆90Sep 30, 2021Updated 4 years ago
SijieSong / CVPR21-Cogrounding_semantic_attention
View on GitHub
☆14Jul 13, 2021Updated 5 years ago
aws / aws-refcocog-adv
View on GitHub
☆22Jan 14, 2026Updated 6 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
zyang-ur / onestage_grounding
View on GitHub
A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)
☆150Nov 18, 2020Updated 5 years ago
yanxinzju / CSS-VQA
View on GitHub
Counterfactual Samples Synthesizing for Robust VQA
☆78Nov 24, 2022Updated 3 years ago
daqingliu / awesome-vln
View on GitHub
A curated list of research papers in Vision-Language Navigation (VLN)
☆238Apr 17, 2024Updated 2 years ago
ccvl / iep-ref
View on GitHub
Inferring and Executing Programs for Visual Reasoning
☆21Jan 4, 2019Updated 7 years ago
wangpengnorman / KB-Ref_dataset
View on GitHub
☆16Dec 28, 2020Updated 5 years ago
lichengunc / speaker_listener_reinforcer
View on GitHub
Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension
☆34Mar 8, 2018Updated 8 years ago
zhjohnchan / SK-VG
View on GitHub
[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.
☆34Jul 12, 2023Updated 3 years ago
luogen1996 / MCN
View on GitHub
[CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)
☆139Aug 4, 2022Updated 3 years ago
youngfly11 / LCMCG-PyTorch
View on GitHub
AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"
☆58Oct 25, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chrisc36 / bottom-up-attention-vqa
View on GitHub
BottomUpTopDown VQA model with question-type debiasing
☆22Oct 6, 2019Updated 6 years ago
ubc-vision / RefTR
View on GitHub
Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021
☆67May 26, 2022Updated 4 years ago
yrcong / NODIS
View on GitHub
Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020
☆12Aug 28, 2020Updated 5 years ago
mjhucla / Google_Refexp_toolbox
View on GitHub
The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283
☆166Mar 1, 2017Updated 9 years ago
iQua / M-DGT
View on GitHub
The source code of the CVPR22 paper titled "Multi-Modal Dynamic Graph Transformer for Visual Grounding".
☆22Mar 26, 2022Updated 4 years ago
svip-lab / LBYLNet
View on GitHub
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
☆50Aug 31, 2021Updated 4 years ago
TheShadow29 / awesome-grounding
View on GitHub
awesome grounding: A curated list of research papers in visual grounding
☆1,126Sep 21, 2025Updated 10 months ago
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
mengcaopku / DCNet
View on GitHub
[ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension
☆15Sep 4, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
uqzhichen / Awesome-compositional-zero-shot-learning
View on GitHub
Paper list of compositional zero-shot learning
☆11Jul 5, 2022Updated 4 years ago
lichengunc / refer
View on GitHub
Referring Expression Datasets API
☆573Aug 27, 2024Updated last year
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
xh-liu / CM-Erase-REG
View on GitHub
Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"
☆34Jul 29, 2019Updated 6 years ago
MarkMoHR / Awesome-Referring-Image-Segmentation
View on GitHub
A collection of papers about Referring Image Segmentation.
☆826Jan 28, 2026Updated 5 months ago
daqingliu / CAVP
View on GitHub
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…
☆46Jul 27, 2019Updated 6 years ago
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
TomVeniat / MNTDP
View on GitHub
Implementation of [MNTDP](https://arxiv.org/abs/2012.12631)
☆18Mar 9, 2022Updated 4 years ago
PluviophileYU / CVC-QA
View on GitHub
Code for "Counterfactual Variable Control for Robust and Interpretable Question Answering"
☆14Oct 13, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆45Nov 25, 2020Updated 5 years ago
allenai / reclip
View on GitHub
☆92Apr 15, 2022Updated 4 years ago
insomnia94 / ISREG
View on GitHub
iterative shrinking for referring expression grounding using deep reinforcement learning
☆14Nov 27, 2021Updated 4 years ago
ChenyunWu / PhraseCutDataset
View on GitHub
Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"
☆116Mar 28, 2026Updated 3 months ago
yuleiniu / vc
View on GitHub
Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"
☆30Jul 4, 2018Updated 8 years ago
luogen1996 / SimREC
View on GitHub
A lightweight codebase for referring expression comprehension and segmentation
☆57May 21, 2022Updated 4 years ago
sunnychencool / AOQ
View on GitHub
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Jul 2, 2020Updated 6 years ago