SijieSong / CVPR21-Cogrounding_semantic_attentionView external linksLinks
☆14Jul 13, 2021Updated 4 years ago
Alternatives and similar repositories for CVPR21-Cogrounding_semantic_attention
Users that are interested in CVPR21-Cogrounding_semantic_attention are comparing it to the libraries listed below
Sorting:
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"☆16Oct 22, 2022Updated 3 years ago
- [CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation☆19May 7, 2021Updated 4 years ago
- A curated list of research papers in Referring Expression Comprehension (REC)☆46May 13, 2021Updated 4 years ago
- Inferring and Executing Programs for Visual Reasoning☆21Jan 4, 2019Updated 7 years ago
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆58Oct 25, 2021Updated 4 years ago
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆112Oct 15, 2021Updated 4 years ago
- 我的个人wiki、博客☆29Jan 24, 2026Updated 3 weeks ago
- A reading list of papers about Visual Grounding.☆32Aug 24, 2022Updated 3 years ago
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29May 31, 2023Updated 2 years ago
- Implementation of "Spectral Feature Tansformation for Person Re-identification"☆31Sep 7, 2019Updated 6 years ago
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- ☆10Jun 21, 2024Updated last year
- ☆12Apr 5, 2016Updated 9 years ago
- Repository to storage the 4mula dataset☆10Sep 1, 2021Updated 4 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆34Jul 29, 2019Updated 6 years ago
- Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos☆87Nov 22, 2020Updated 5 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆89Sep 30, 2021Updated 4 years ago
- Video-aided Unsupervised Grammar Induction, NAACL‘21 [best long paper]☆40Oct 27, 2022Updated 3 years ago
- Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020☆11Aug 28, 2020Updated 5 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 4 years ago
- Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)☆39Nov 23, 2019Updated 6 years ago
- Code for paper titled, "Learning to Predict Task Progress by Self-Supervised Video Alignment" by Gerard Donahue and Ehsan Elhamifar, publ…☆16Jul 26, 2024Updated last year
- ☆12Aug 14, 2019Updated 6 years ago
- Load and visualize different datasets in video question answering☆10May 11, 2021Updated 4 years ago
- Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'☆10Mar 11, 2024Updated last year
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- ☆10Aug 22, 2023Updated 2 years ago
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- [CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.☆49Aug 31, 2021Updated 4 years ago
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- AAAI 2020. Spatial-Temporal Synchronous Graph Convolutional Networks: A New Framework for Spatial-Temporal Network Data Forecasting☆12Dec 20, 2019Updated 6 years ago
- Paper list of compositional zero-shot learning☆11Jul 5, 2022Updated 3 years ago
- "What is Learned in Visually Grounded Neural Syntax Acquisition", Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi (AC…☆12Dec 30, 2021Updated 4 years ago
- ☆11Jan 14, 2017Updated 9 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Jul 20, 2020Updated 5 years ago
- PlanT 2.0: Exposing Biases and Structural Flaws in Closed-Loop Driving☆39Feb 3, 2026Updated 2 weeks ago
- Repository for codes in the paper "System-and Sample-agnostic Isotropic 3D Microscopy by Weakly Physics-informed, Domain-shift-resistant …☆16Aug 26, 2025Updated 5 months ago
- video captioning using 3DCNN and LSTM (pytorch)☆11Sep 26, 2019Updated 6 years ago