[ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization
☆39Jul 29, 2022Updated 3 years ago
Alternatives and similar repositories for LocVTP
Users that are interested in LocVTP are comparing it to the libraries listed below
Sorting:
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Aug 5, 2022Updated 3 years ago
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆119Oct 9, 2023Updated 2 years ago
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29May 31, 2023Updated 2 years ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆91Nov 16, 2022Updated 3 years ago
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Sep 4, 2022Updated 3 years ago
- An optimized re-implementation for 2D-TAN: Learning 2D Temporal Localization Networks for Moment Localization with Natural Language (AAAI…☆128Apr 1, 2023Updated 2 years ago
- Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"☆131Jul 5, 2021Updated 4 years ago
- TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks (ICCVW 2021)☆117Sep 16, 2023Updated 2 years ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- A reading list of papers about Visual Grounding.☆32Aug 24, 2022Updated 3 years ago
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆69Jun 27, 2022Updated 3 years ago
- [CVPR2022] Unsupervised Pre-training for Temporal Action Localization Tasks (UP-TAL)☆29Mar 9, 2022Updated 3 years ago
- UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …☆236Apr 15, 2024Updated last year
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆48Mar 15, 2023Updated 2 years ago
- ☆12Mar 12, 2023Updated 2 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆188May 1, 2025Updated 10 months ago
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning☆15Dec 12, 2023Updated 2 years ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- VLG-Net: Video-Language Graph Matching Networks for Video Grounding☆31May 31, 2022Updated 3 years ago
- A curated list of grounding natural language in video and related area. :-)☆102Mar 31, 2022Updated 3 years ago
- ☆26Aug 4, 2020Updated 5 years ago
- Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).☆141Jul 20, 2022Updated 3 years ago
- End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Oct 24, 2021Updated 4 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- source code of our RaNet in EMNLP 2021☆30May 31, 2022Updated 3 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Jul 20, 2020Updated 5 years ago
- [ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval☆161May 28, 2024Updated last year
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Mar 9, 2022Updated 3 years ago
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Aug 25, 2020Updated 5 years ago
- ☆80Nov 24, 2024Updated last year
- [ECCV 2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval☆79Nov 29, 2022Updated 3 years ago
- Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆117Jun 9, 2021Updated 4 years ago
- Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作☆30Mar 4, 2022Updated 3 years ago
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆55Sep 7, 2023Updated 2 years ago
- 前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。☆261Aug 26, 2023Updated 2 years ago
- ☆16Dec 15, 2022Updated 3 years ago
- Dense Regression Network for Video Grounding (CVPR2020)☆53Jan 28, 2021Updated 5 years ago
- Public repository for DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video Code accompan…☆21Apr 7, 2021Updated 4 years ago
- [CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".☆294Jun 13, 2024Updated last year