The source code of the CVPR22 paper titled "Multi-Modal Dynamic Graph Transformer for Visual Grounding".
☆22Mar 26, 2022Updated 3 years ago
Alternatives and similar repositories for M-DGT
Users that are interested in M-DGT are comparing it to the libraries listed below
Sorting:
- Preliminary code for reviewers☆13Mar 30, 2021Updated 4 years ago
- ☆195Feb 27, 2024Updated 2 years ago
- ☆16Nov 14, 2022Updated 3 years ago
- This is an implementation of "Grounding of Textual Phrases in Images by Reconstruction" in PyTorch☆17Apr 7, 2020Updated 5 years ago
- ☆41Jun 3, 2022Updated 3 years ago
- ☆22Jan 14, 2026Updated last month
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- ☆20Apr 2, 2024Updated last year
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆96Dec 2, 2022Updated 3 years ago
- An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".☆52Jun 7, 2021Updated 4 years ago
- 'Bi-directional Relationship Inferring Network for Referring Image Segmentation' CVPR2020☆18Apr 2, 2022Updated 3 years ago
- ☆10Nov 22, 2022Updated 3 years ago
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆58Oct 25, 2021Updated 4 years ago
- The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021☆27Oct 9, 2021Updated 4 years ago
- ☆14Sep 20, 2025Updated 5 months ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆132Nov 10, 2025Updated 3 months ago
- Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension☆34Mar 8, 2018Updated 7 years ago
- The official implementation of "Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization" (CVPR 2025)☆14Nov 20, 2025Updated 3 months ago
- Repository of proposal-free temporal moment localization work☆33Jun 11, 2024Updated last year
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Aug 22, 2020Updated 5 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆145Oct 30, 2024Updated last year
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆153Jul 13, 2024Updated last year
- Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"☆31Feb 19, 2023Updated 3 years ago
- ☆10Jun 21, 2024Updated last year
- ☆10Jun 8, 2024Updated last year
- The code will come soon.☆15Sep 12, 2025Updated 5 months ago
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- ☆10Dec 11, 2021Updated 4 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆89Sep 30, 2021Updated 4 years ago
- Python code to break SVG files into polygon objects consumable in Tableau.☆10Mar 16, 2020Updated 5 years ago
- ☆12Jun 27, 2022Updated 3 years ago
- ☆16Jun 14, 2024Updated last year
- This is the pytorch implementation for the paper "Delta-encoder: an effective sample synthesis method for few-shot object recognition" ht…☆11Jan 10, 2020Updated 6 years ago
- ☆11Dec 13, 2023Updated 2 years ago
- a Unified Model for Scene Search and Synthesis from Sketch☆11Aug 18, 2021Updated 4 years ago
- implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering☆10Mar 17, 2022Updated 3 years ago
- This repository is the official implementation of our paper Robust Diffusion Model-Generated Image Detection with CLIP, accepted by MIPR …☆10Jun 13, 2024Updated last year
- GraphOfDocs: Representing multiple documents as a single graph☆21Jun 22, 2022Updated 3 years ago
- Agent based-model of the banking system (NetLogo)☆11Apr 13, 2018Updated 7 years ago