This is the repo for Multi-level textual grounding
☆34Jul 21, 2020Updated 5 years ago
Alternatives and similar repositories for MultiGrounding
Users that are interested in MultiGrounding are comparing it to the libraries listed below
Sorting:
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Aug 22, 2020Updated 5 years ago
- This is an implementation of "Grounding of Textual Phrases in Images by Reconstruction" in PyTorch☆17Apr 7, 2020Updated 5 years ago
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- ☆64Jan 5, 2022Updated 4 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆89Sep 30, 2021Updated 4 years ago
- Code for "Counterfactual Variable Control for Robust and Interpretable Question Answering"☆14Oct 13, 2020Updated 5 years ago
- Referring Expression Parser☆27Feb 10, 2018Updated 8 years ago
- Project page for "Visual Grounding in Video for Unsupervised Word Translation" CVPR 2020☆43Apr 26, 2020Updated 5 years ago
- Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019☆92Aug 9, 2019Updated 6 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- Visual Relation Grounding in Videos (ECCV'20, Spotlight)☆57Dec 8, 2022Updated 3 years ago
- Inferring and Executing Programs for Visual Reasoning☆21Jan 4, 2019Updated 7 years ago
- Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017☆67Sep 20, 2018Updated 7 years ago
- Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)☆13Jul 25, 2024Updated last year
- ☆11Jan 24, 2021Updated 5 years ago
- TODO ;)☆12Aug 13, 2018Updated 7 years ago
- Repository for hosting the code for the CVPR 2020 paper Differentiable Adaptive Computation Time for Visual Reasoning.☆14Aug 26, 2020Updated 5 years ago
- iterative shrinking for referring expression grounding using deep reinforcement learning☆14Nov 27, 2021Updated 4 years ago
- ☆15Nov 23, 2020Updated 5 years ago
- Preliminary code for reviewers☆13Mar 30, 2021Updated 4 years ago
- PyTorch bottom-up attention with Detectron2☆239Jan 4, 2022Updated 4 years ago
- ☆16Dec 28, 2020Updated 5 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Aug 29, 2019Updated 6 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆34Jul 29, 2019Updated 6 years ago
- Scene Graph Parsing as Dependency Parsing☆41May 22, 2019Updated 6 years ago
- ☆478Nov 21, 2022Updated 3 years ago
- SelfCriticalSequenceTrainingforImageCaptioning☆21May 27, 2017Updated 8 years ago
- Generic Object ZSL Dataset (GOZ)☆17Oct 19, 2022Updated 3 years ago
- implement n2nmn with pytorch☆19Apr 10, 2019Updated 6 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆69Jun 10, 2020Updated 5 years ago
- PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations☆42Jun 30, 2021Updated 4 years ago
- ☆22Jan 14, 2026Updated last month
- ☆17Oct 20, 2020Updated 5 years ago
- The code and output of our AAAI paper "Knowledge-Enriched Visual Storytelling"☆41May 3, 2021Updated 4 years ago
- Implementation of Soft-Label Chain Conditional Random Field for Phrase Grounding in PyTorch☆16Oct 21, 2022Updated 3 years ago
- Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.☆116Aug 10, 2020Updated 5 years ago
- Word sense disambiguation using contextualized word embedding☆17Dec 18, 2019Updated 6 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Apr 15, 2022Updated 3 years ago
- Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)☆134Mar 15, 2024Updated last year