This is an implementation of "Grounding of Textual Phrases in Images by Reconstruction" in PyTorch
☆17Apr 7, 2020Updated 5 years ago
Alternatives and similar repositories for GroundeR-PyTorch
Users that are interested in GroundeR-PyTorch are comparing it to the libraries listed below
Sorting:
- The substitution of qsub.☆12Jan 25, 2019Updated 7 years ago
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Aug 22, 2020Updated 5 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- This is the repo for Multi-level textual grounding☆34Jul 21, 2020Updated 5 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Aug 29, 2019Updated 6 years ago
- This repository contains the main baselines introduced in WSSTG (ACL 2019).☆56Jul 8, 2024Updated last year
- 'Bi-directional Relationship Inferring Network for Referring Image Segmentation' CVPR2020☆18Apr 2, 2022Updated 3 years ago
- Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding☆23Jun 27, 2018Updated 7 years ago
- Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…☆71Apr 22, 2020Updated 5 years ago
- TODO ;)☆12Aug 13, 2018Updated 7 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆89Sep 30, 2021Updated 4 years ago
- Query Learning of Both Thing and Stuff for Panoptic Segmentation-ICIP-2022☆15Sep 3, 2022Updated 3 years ago
- This repository provides the dataset introduced by our WSSTG paper☆13Jul 21, 2019Updated 6 years ago
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆58Oct 25, 2021Updated 4 years ago
- Official Code for ECCV2022: Learning Semantic Correspondence with Sparse Annotations☆18Aug 22, 2022Updated 3 years ago
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆297Nov 29, 2022Updated 3 years ago
- Scene Graph Parsing as Dependency Parsing☆41May 22, 2019Updated 6 years ago
- ☆17Oct 20, 2020Updated 5 years ago
- ☆22Jan 14, 2026Updated last month
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆37Nov 26, 2025Updated 3 months ago
- The source code of the CVPR22 paper titled "Multi-Modal Dynamic Graph Transformer for Visual Grounding".☆22Mar 26, 2022Updated 3 years ago
- Accepted by AAAI2022☆21Apr 10, 2022Updated 3 years ago
- Inferring and Executing Programs for Visual Reasoning☆21Jan 4, 2019Updated 7 years ago
- Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…☆59Mar 24, 2023Updated 2 years ago
- Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019☆92Aug 9, 2019Updated 6 years ago
- The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021☆27Oct 9, 2021Updated 4 years ago
- Referring Expression Parser☆27Feb 10, 2018Updated 8 years ago
- ☆27Jun 11, 2022Updated 3 years ago
- Visual Grounding of Referring Expressions for Human-Robot Interaction☆26Nov 16, 2018Updated 7 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆68May 26, 2022Updated 3 years ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Jan 18, 2023Updated 3 years ago
- Accepted by CVPR 2020.☆27Jul 11, 2024Updated last year
- [arXiv 2020] Video Representation Learning with Visual Tempo Consistency☆24Jun 30, 2020Updated 5 years ago
- Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.☆116Aug 10, 2020Updated 5 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆69Jun 10, 2020Updated 5 years ago
- The official implementation of "Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization" (CVPR 2025)☆14Nov 20, 2025Updated 3 months ago
- SurgLaVi: Large-Scale Hierarchical Datasets for Surgical Vision–Language Representation Learning☆23Feb 2, 2026Updated 3 weeks ago
- ☆35May 2, 2022Updated 3 years ago
- Code for "Mining the Benefits of Two-stage and One-stage HOI Detection"☆90Mar 31, 2024Updated last year