paper list on Video Moment Retrieval (VMR), or Natural Language Video Localization (NLVL), or Temporal Sentence Grounding in Videos (TSGV))
☆38Jan 12, 2023Updated 3 years ago
Alternatives and similar repositories for awesome-video-moment-retrieval
Users that are interested in awesome-video-moment-retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- source code of our MGPN in SIGIR 2022☆18Jun 8, 2022Updated 4 years ago
- A reading list of papers about Visual Grounding.☆31Aug 24, 2022Updated 3 years ago
- 前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。☆264Aug 26, 2023Updated 2 years ago
- paper list on Video Moment Retrieval (VMR), or Temporal Video Grounding (TVG), Video Grounding (VG), or Temporal Sentence Grounding in Vi…☆41Dec 27, 2025Updated 6 months ago
- [NeurIPS 2021] Moment-DETR code and QVHighlights dataset☆346Mar 9, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29May 31, 2023Updated 3 years ago
- Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)☆58Aug 31, 2021Updated 4 years ago
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Aug 5, 2022Updated 3 years ago
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…☆40Jan 20, 2025Updated last year
- Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …☆250Aug 12, 2025Updated 10 months ago
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding☆31Aug 5, 2023Updated 2 years ago
- Pytorch Implementation of ECCV'22 paper: Video Activity Localisation with Uncertainties in Temporal Boundary☆17Jul 17, 2022Updated 3 years ago
- Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”☆10Apr 3, 2022Updated 4 years ago
- 📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.☆29Jul 2, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval☆43Sep 23, 2021Updated 4 years ago
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- ☆10Aug 21, 2022Updated 3 years ago
- Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos☆16May 23, 2023Updated 3 years ago
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆68Jun 27, 2022Updated 4 years ago
- Source code of our MM'22 paper Partially Relevant Video Retrieval☆57Nov 4, 2024Updated last year
- [TIP25] Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"☆16May 12, 2025Updated last year
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆31Apr 4, 2022Updated 4 years ago
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆154Aug 21, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"☆132Jul 5, 2021Updated 4 years ago
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Aug 25, 2020Updated 5 years ago
- [EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models☆148Aug 21, 2025Updated 10 months ago
- [ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding☆44Mar 18, 2025Updated last year
- ☆13Nov 28, 2021Updated 4 years ago
- [NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding☆54Mar 5, 2024Updated 2 years ago
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆113Oct 15, 2021Updated 4 years ago
- VIDIMU-TOOLS is a code repository related to the public dataset "VIDIMU. multimodal video and IMU kinematic dataset on daily life activit…☆11Jun 2, 2024Updated 2 years ago
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆91Nov 16, 2022Updated 3 years ago
- CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning☆65Mar 22, 2026Updated 3 months ago
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆95Mar 9, 2025Updated last year
- UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …☆240Apr 15, 2024Updated 2 years ago
- Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval☆100Jan 23, 2022Updated 4 years ago
- PIC API☆25Sep 18, 2019Updated 6 years ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆41Apr 11, 2025Updated last year