TheShadow29/awesome-grounding

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TheShadow29/awesome-grounding)

TheShadow29 / awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

☆1,126

Alternatives and similar repositories for awesome-grounding

Users that are interested in awesome-grounding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
lichengunc / refer
View on GitHub
Referring Expression Datasets API
☆573Aug 27, 2024Updated last year
zyang-ur / ReSC
View on GitHub
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆90Sep 30, 2021Updated 4 years ago
zyang-ur / onestage_grounding
View on GitHub
A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)
☆150Nov 18, 2020Updated 5 years ago
ashkamath / mdetr
View on GitHub
☆1,051Oct 3, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
iworldtong / Awesome-Temporal-Sentence-Grounding-in-Videos
View on GitHub
A curated list of grounding natural language in video and related area. :-)
☆82Dec 16, 2019Updated 6 years ago
TheShadow29 / zsgnet-pytorch
View on GitHub
Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…
☆71Apr 22, 2020Updated 6 years ago
yangli18 / VLTVG
View on GitHub
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
lichengunc / MAttNet
View on GitHub
MAttNet: Modular Attention Network for Referring Expression Comprehension
☆299Nov 29, 2022Updated 3 years ago
yuewang-cuhk / awesome-vision-language-pretraining-papers
View on GitHub
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
☆1,159Aug 19, 2022Updated 3 years ago
MarkMoHR / Awesome-Referring-Image-Segmentation
View on GitHub
A collection of papers about Referring Image Segmentation.
☆826Jan 28, 2026Updated 5 months ago
sibeiyang / sgmn
View on GitHub
Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.
☆117Aug 10, 2020Updated 5 years ago
yytzsy / SCDM
View on GitHub
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
☆71Sep 7, 2021Updated 4 years ago
SCZwangxiao / Temporal-Language-Grounding-in-videos
View on GitHub
Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval
☆100Jan 23, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WuJie1010 / Awesome-Temporally-Language-Grounding
View on GitHub
A curated list of “Temporally Language Grounding” and related area
☆110Nov 28, 2019Updated 6 years ago
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,605Jan 24, 2024Updated 2 years ago
jokieleung / awesome-visual-question-answering
View on GitHub
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…
☆672Jul 6, 2023Updated 3 years ago
BigRedT / info-ground
View on GitHub
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
☆73Aug 22, 2020Updated 5 years ago
ubc-vision / RefTR
View on GitHub
Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021
☆67May 26, 2022Updated 4 years ago
LeapLabTHU / Pseudo-Q
View on GitHub
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
☆153Jul 13, 2024Updated 2 years ago
TheShadow29 / vognet-pytorch
View on GitHub
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆69Jun 10, 2020Updated 6 years ago
youngfly11 / ReIR-WeaklyGrounding.pytorch
View on GitHub
The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021
☆28Oct 9, 2021Updated 4 years ago
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆45Nov 25, 2020Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
GingL / ARN
View on GitHub
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding
☆32Aug 29, 2019Updated 6 years ago
youngfly11 / LCMCG-PyTorch
View on GitHub
AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"
☆58Oct 25, 2021Updated 4 years ago
ikuinen / CMIN_moment_retrieval
View on GitHub
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆87Nov 22, 2020Updated 5 years ago
pliang279 / awesome-multimodal-ml
View on GitHub
Reading list for research topics in multimodal machine learning
☆6,911Aug 20, 2024Updated last year
svip-lab / LBYLNet
View on GitHub
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
☆50Aug 31, 2021Updated 4 years ago
nku-shengzheliu / Pytorch-TransVG
View on GitHub
An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".
☆50Jun 7, 2021Updated 5 years ago
facebookresearch / grounded-video-description
View on GitHub
Video Grounding and Captioning
☆331Oct 12, 2021Updated 4 years ago
daqingliu / awesome-rec
View on GitHub
A curated list of research papers in Referring Expression Comprehension (REC)
☆46May 13, 2021Updated 5 years ago
JonghwanMun / LGI4temporalgrounding
View on GitHub
Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"
☆132Jul 5, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
niluthpol / weak_supervised_video_moment
View on GitHub
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Jul 20, 2020Updated 6 years ago
jianzongwu / Awesome-Open-Vocabulary
View on GitHub
(TPAMI 2024) A Survey on Open Vocabulary Learning
☆998May 12, 2026Updated 2 months ago
vacancy / SceneGraphParser
View on GitHub
A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).
☆595Jan 23, 2024Updated 2 years ago
josiahwang / phraseloceval
View on GitHub
Phrase Localization Evaluation Toolkit
☆20Aug 16, 2019Updated 6 years ago
seanzhuh / SeqTR
View on GitHub
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Oct 30, 2024Updated last year
facebookresearch / ActivityNet-Entities
View on GitHub
A Dataset for Grounded Video Description
☆165Jan 4, 2022Updated 4 years ago
jiyanggao / TALL
View on GitHub
TALL: Temporal Activity Localization via Language Query
☆220Mar 15, 2018Updated 8 years ago