TheShadow29/zsgnet-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TheShadow29/zsgnet-pytorch)

TheShadow29 / zsgnet-pytorch

Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.07129)

☆71

Alternatives and similar repositories for zsgnet-pytorch

Users that are interested in zsgnet-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TheShadow29 / vognet-pytorch
View on GitHub
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆69Jun 10, 2020Updated 6 years ago
ccvl / iep-ref
View on GitHub
Inferring and Executing Programs for Visual Reasoning
☆21Jan 4, 2019Updated 7 years ago
acambray / GroundeR-PyTorch
View on GitHub
This is an implementation of "Grounding of Textual Phrases in Images by Reconstruction" in PyTorch
☆18Apr 7, 2020Updated 6 years ago
zyang-ur / onestage_grounding
View on GitHub
A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)
☆150Nov 18, 2020Updated 5 years ago
ChopinSharp / ref-nms
View on GitHub
Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"
☆22Dec 20, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sibeiyang / sgmn
View on GitHub
Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.
☆117Aug 10, 2020Updated 5 years ago
zyang-ur / ReSC
View on GitHub
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆90Sep 30, 2021Updated 4 years ago
ChenyunWu / PhraseCutDataset
View on GitHub
Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"
☆116Mar 28, 2026Updated 3 months ago
GingL / ARN
View on GitHub
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding
☆32Aug 29, 2019Updated 6 years ago
zfchenUnique / WSSTG
View on GitHub
This repository contains the main baselines introduced in WSSTG (ACL 2019).
☆57Jul 8, 2024Updated 2 years ago
zfchenUnique / Cops-Ref
View on GitHub
Accepted by CVPR 2020.
☆27Jul 11, 2024Updated 2 years ago
lichengunc / MAttNet
View on GitHub
MAttNet: Modular Attention Network for Referring Expression Comprehension
☆299Nov 29, 2022Updated 3 years ago
BryanPlummer / cite
View on GitHub
Implementation for our paper "Conditional Image-Text Embedding Networks"
☆39Mar 19, 2020Updated 6 years ago
youngfly11 / LCMCG-PyTorch
View on GitHub
AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"
☆58Oct 25, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BigRedT / info-ground
View on GitHub
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
☆73Aug 22, 2020Updated 5 years ago
TheShadow29 / awesome-grounding
View on GitHub
awesome grounding: A curated list of research papers in visual grounding
☆1,126Sep 21, 2025Updated 10 months ago
daqingliu / NMTree
View on GitHub
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
☆38Nov 23, 2019Updated 6 years ago
gsig / visual-grounding
View on GitHub
Project page for "Visual Grounding in Video for Unsupervised Word Translation" CVPR 2020
☆43Apr 26, 2020Updated 6 years ago
lichengunc / refer
View on GitHub
Referring Expression Datasets API
☆573Aug 27, 2024Updated last year
XiangChenchao / DDPN
View on GitHub
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding
☆23Jun 27, 2018Updated 8 years ago
ruotianluo / refexp-comprehension
View on GitHub
Referring expression comprehension on ReferIt(RefClef)
☆10Nov 28, 2016Updated 9 years ago
facebookresearch / TextVQA
View on GitHub
Website for TextVQA dataset.
☆30Apr 30, 2023Updated 3 years ago
BigRedT / vico
View on GitHub
Multi-sense word embeddings from visual co-occurrences
☆25Sep 5, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
princetonvisualai / SPICE-U
View on GitHub
☆11Sep 7, 2020Updated 5 years ago
JaywongWang / CBP
View on GitHub
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆59Mar 24, 2023Updated 3 years ago
yiyang92 / vae_captioning
View on GitHub
Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
☆60Apr 5, 2018Updated 8 years ago
ExplorerFreda / VGNSL
View on GitHub
[ACL 2019] Visually Grounded Neural Syntax Acquisition
☆90Feb 24, 2024Updated 2 years ago
TheShadow29 / visual-commonsense-pytorch
View on GitHub
For visual commonsense model
☆34Apr 12, 2019Updated 7 years ago
Guaranteer / VidSTG-Dataset
View on GitHub
This repository provides the dataset introduced by the paper "Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentenc…
☆70May 1, 2020Updated 6 years ago
ikuinen / CMIN_moment_retrieval
View on GitHub
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆87Nov 22, 2020Updated 5 years ago
fenglinliu98 / MIA
View on GitHub
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）
☆65Oct 19, 2020Updated 5 years ago
kanchen-usc / KAC-Net
View on GitHub
Implementation of Knowledge Aided Consistency for Weakly Supervised Phrase Grounding in Tensorflow
☆95Mar 29, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
li-xirong / video-retrieval
View on GitHub
Deep Learning for Video Retrieval by Natural Language
☆11Oct 20, 2019Updated 6 years ago
jimmy646 / violin
View on GitHub
Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"
☆161Apr 29, 2020Updated 6 years ago
luogen1996 / MCN
View on GitHub
[CVPR2020] Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation, CVPR2020 (oral)
☆139Aug 4, 2022Updated 3 years ago
niansong1996 / wassp
View on GitHub
Official code for AAAI'20 paper "Merging Weak and Active Supervision for Semantic Parsing"
☆11Dec 8, 2022Updated 3 years ago
jackroos / VL-BERT
View on GitHub
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
☆742May 22, 2023Updated 3 years ago
YuanEZhou / Grounded-Image-Captioning
View on GitHub
☆64Jan 5, 2022Updated 4 years ago
runzhouge / MAC
View on GitHub
MAC: Mining Activity Concepts for Language-based Temporal Localization
☆36Nov 26, 2018Updated 7 years ago