26hzhang / VSLNet
Span-based Localizing Network for Natural Language Video Localization (ACL 2020)
☆100Updated 2 years ago
Related projects: ⓘ
- Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"☆129Updated 3 years ago
- Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval☆93Updated 2 years ago
- Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)☆51Updated 3 years ago
- ☆34Updated 3 years ago
- This repository provides the dataset introduced by the paper "Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentenc…☆54Updated 4 years ago
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022.☆63Updated 2 years ago
- Dense Regression Network for Video Grounding (CVPR2020)☆50Updated 3 years ago
- Repository of proposal-free temporal moment localization work☆33Updated 3 months ago
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Updated 2 years ago
- Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization☆33Updated 4 years ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆45Updated last year
- VLG-Net: Video-Language Graph Matching Networks for Video Grounding☆30Updated 2 years ago
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29Updated last year
- ☆29Updated 2 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆42Updated 4 years ago
- Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos☆86Updated 3 years ago
- A curated list of grounding natural language in video and related area. :-)☆88Updated 2 years ago
- ☆16Updated 2 years ago
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆45Updated last year
- ☆25Updated 4 years ago
- Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆95Updated 3 years ago
- The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"☆30Updated 5 years ago
- Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos☆68Updated 3 years ago
- The HC-STVG Dataset☆53Updated last year
- [ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation☆68Updated 2 years ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆89Updated last year
- [ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation☆57Updated 2 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆67Updated 4 years ago
- A reading list of papers about Visual Grounding.☆31Updated 2 years ago
- CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning☆56Updated 5 months ago