前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
☆14Nov 8, 2023Updated 2 years ago
Alternatives and similar repositories for Awesome-Cross-Modal-Video-Moment-Retrieval
Users that are interested in Awesome-Cross-Modal-Video-Moment-Retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ACM MULTIMEDIA CONFERENCE 2020☆11Jul 28, 2020Updated 5 years ago
- Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作☆31Mar 4, 2022Updated 4 years ago
- Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval☆100Jan 23, 2022Updated 4 years ago
- ☆15Aug 28, 2024Updated last year
- Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆118Jun 9, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization☆16Jul 20, 2023Updated 2 years ago
- Dense Regression Network for Video Grounding (CVPR2020)☆53Jan 28, 2021Updated 5 years ago
- ☆19May 14, 2025Updated last year
- Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …☆42Aug 5, 2022Updated 3 years ago
- "Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022☆68Jun 27, 2022Updated 3 years ago
- ☆19Jul 28, 2025Updated 10 months ago
- 📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.☆29Jul 2, 2025Updated 11 months ago
- 关于一些经典论文源码的逐行中文笔记☆600Oct 19, 2022Updated 3 years ago
- A reading list of papers about Visual Grounding.☆31Aug 24, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"☆11Mar 21, 2025Updated last year
- Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos☆87Nov 22, 2020Updated 5 years ago
- The top conferences on video retrieval libraries in recent years, synchronized with my blog.☆14Nov 27, 2021Updated 4 years ago
- [IJCAI-24] Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks☆11Sep 2, 2024Updated last year
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆48Mar 15, 2023Updated 3 years ago
- Multiple Attractors simulation with customization☆14Feb 22, 2026Updated 3 months ago
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 9 months ago
- VideoX: a collection of video cross-modal models☆1,066Jun 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Repo for CVPR 2025 Paper -- DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos☆17Mar 16, 2026Updated 3 months ago
- ☆13Jan 5, 2022Updated 4 years ago
- ☆17Jun 15, 2022Updated 4 years ago
- implementation of "Action Quality Assessment with Temporal Parsing Transformer"☆25Aug 2, 2022Updated 3 years ago
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning☆15Dec 12, 2023Updated 2 years ago
- ☆14Oct 30, 2023Updated 2 years ago
- The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".☆29May 14, 2023Updated 3 years ago
- [IEEE TGRS'23] Location-aware Adaptive Normalization: A Deep Learning Approach for Wildfire Danger Forecasting☆18Apr 7, 2025Updated last year
- VLG-Net: Video-Language Graph Matching Networks for Video Grounding☆31May 31, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”☆10Apr 3, 2022Updated 4 years ago
- ☆24Oct 14, 2024Updated last year
- A ComfyUI extension for StyleShot.☆16Apr 23, 2025Updated last year
- source code for ICCV2021 paper "MFNet: Multi-filter Directive Network for Weakly Supervised Salient Object Detection"☆11Jul 17, 2022Updated 3 years ago
- ☆16Dec 25, 2021Updated 4 years ago
- Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)☆108Jan 23, 2025Updated last year
- 利用pytorch实现的wide&deep,并利用avazu数据集进行了验证☆10Feb 4, 2021Updated 5 years ago