Phrase Localization Evaluation Toolkit
☆20Aug 16, 2019Updated 6 years ago
Alternatives and similar repositories for phraseloceval
Users that are interested in phraseloceval are comparing it to the libraries listed below
Sorting:
- Gender/Age attribute grounding using weak supervised manner.☆12Jun 23, 2019Updated 6 years ago
- The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021☆27Oct 9, 2021Updated 4 years ago
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Aug 22, 2020Updated 5 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆26Jan 20, 2022Updated 4 years ago
- Central repository for all public AIDA resources☆13Mar 1, 2021Updated 5 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Aug 29, 2019Updated 6 years ago
- Weakly Supervised Grounding for VQA in Vision-Language Transformers☆16May 6, 2023Updated 2 years ago
- Implementation of Soft-Label Chain Conditional Random Field for Phrase Grounding in PyTorch☆16Oct 21, 2022Updated 3 years ago
- An unofficial pytorch implementation of "TransVG: End-to-End Visual Grounding with Transformers".☆52Jun 7, 2021Updated 4 years ago
- Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding☆23Jun 27, 2018Updated 7 years ago
- A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)☆148Nov 18, 2020Updated 5 years ago
- Implementation for "Joint Event Detection and Description in Continuous Video Streams"☆23Nov 4, 2020Updated 5 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Jun 28, 2021Updated 4 years ago
- Learning Compatible Embeddings, ICCV 2021☆33Aug 18, 2021Updated 4 years ago
- Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019☆31Apr 21, 2021Updated 4 years ago
- awesome grounding: A curated list of research papers in visual grounding☆1,125Sep 21, 2025Updated 5 months ago
- Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.☆116Aug 10, 2020Updated 5 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…☆71Apr 22, 2020Updated 5 years ago
- Code for Net2Vec: Quantifying and Explaining how Concepts are Encoded by Filters in Deep Neural Networks☆30Feb 8, 2018Updated 8 years ago
- pytorch implementation of mvp: a multi-stage vision-language pre-training framework☆34Mar 1, 2023Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- ☆12Aug 30, 2022Updated 3 years ago
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Dec 5, 2022Updated 3 years ago
- Dynamic mode decomposition in Python☆13Jun 9, 2015Updated 10 years ago
- Implementation of Background Substraction using Gaussian mixture model and using OpenCV library.☆11Feb 7, 2018Updated 8 years ago
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- Codebase for EA Modeling (for Transactions on Affective Computing paper)☆12Dec 8, 2022Updated 3 years ago
- Implementation of "Make One-Shot Video Object Segmentation Efficient Again” and the semi-supervised fine-tuning "e-OSVOS" approach (NeurI…☆36Mar 24, 2021Updated 4 years ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆96Dec 2, 2022Updated 3 years ago
- Official implementation of Lightweight Human Pose Estimation Using Loss Weighted by Target Heatmap that was honorably mentioned as Best P…☆11Dec 17, 2023Updated 2 years ago
- 关于behance爬虫项目☆10May 16, 2019Updated 6 years ago
- Port of Chromaprint C/C++ library to Ruby to extract fingerprints from audio sources.☆12Nov 7, 2013Updated 12 years ago
- TIMES Demo Model☆15Jan 18, 2024Updated 2 years ago
- paper code commit-fsmafl☆10Mar 18, 2024Updated last year