nakaizura/Awesome-Cross-Modal-Video-Moment-Retrieval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nakaizura/Awesome-Cross-Modal-Video-Moment-Retrieval)

nakaizura / Awesome-Cross-Modal-Video-Moment-Retrieval

前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。

☆14

Alternatives and similar repositories for Awesome-Cross-Modal-Video-Moment-Retrieval

Users that are interested in Awesome-Cross-Modal-Video-Moment-Retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yawenzeng / STRONG
View on GitHub
ACM MULTIMEDIA CONFERENCE 2020
☆11Jul 28, 2020Updated 6 years ago
Huntersxsx / TSGV-Learning-List
View on GitHub
Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作
☆31Mar 4, 2022Updated 4 years ago
yawenzeng / Awesome-Cross-Modal-Video-Moment-Retrieval
View on GitHub
前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。
☆266Aug 26, 2023Updated 2 years ago
SCZwangxiao / Temporal-Language-Grounding-in-videos
View on GitHub
Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval
☆100Jan 23, 2022Updated 4 years ago
linjieli222 / HERO_Video_Feature_Extractor
View on GitHub
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆118Jun 9, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
minghangz / SPL
View on GitHub
Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization
☆16Jul 20, 2023Updated 3 years ago
pavanravva / Enhanced-MMASD
View on GitHub
☆19May 14, 2025Updated last year
Lilidamowang / T2VIndexer-generativeSearch
View on GitHub
☆16Aug 28, 2024Updated last year
sangminwoo / Explore-And-Match
View on GitHub
Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …
☆42Aug 5, 2022Updated 3 years ago
Alvin-Zeng / DRN
View on GitHub
Dense Regression Network for Video Grounding (CVPR2020)
☆53Jan 28, 2021Updated 5 years ago
iworldtong / TALL.pytorch
View on GitHub
PyTorch implementation of "TALL: Temporal Activity Localization via Language Query. Gao et al. ICCV2017."
☆14Apr 20, 2019Updated 7 years ago
r-cui / ViGA
View on GitHub
"Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022
☆68Jun 27, 2022Updated 4 years ago
ZQSIAT / OV-OAD
View on GitHub
This repo takes the initial step towards leveraging text learning for online action detection without explicit human supervision.
☆15Jul 13, 2026Updated 2 weeks ago
26hzhang / VSLNet
View on GitHub
Span-based Localizing Network for Natural Language Video Localization (ACL 2020)
☆113Oct 15, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
icq-benchmark / icq-benchmark
View on GitHub
☆19Jul 28, 2025Updated last year
HengLan / TA-STVG
View on GitHub
[ICLR 2025] Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding
☆44Mar 18, 2025Updated last year
yeliudev / nncore
View on GitHub
📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.
☆29Jul 9, 2026Updated 3 weeks ago
zhengxuJosh / 360SFUDA
View on GitHub
Code for Panoramic Semantic Segmentation
☆16Apr 26, 2024Updated 2 years ago
Milittle / GaitSystem
View on GitHub
Project GaitSystem is a Gait recognition system based Windows Visual Studio 2017. The algorithm based the paper Gait optical flow image d…
☆12Jun 3, 2019Updated 7 years ago
NeverMoreLCH / Awesome-Video-Grounding
View on GitHub
A reading list of papers about Visual Grounding.
☆31Aug 24, 2022Updated 3 years ago
GONGJIA0208 / Diffpose_video
View on GitHub
The code of DIffpose video setting
☆16Dec 28, 2023Updated 2 years ago
May2333 / FDCA
View on GitHub
[ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…
☆23Jul 28, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HuiGuanLab / RaTSG
View on GitHub
This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"
☆13Aug 22, 2025Updated 11 months ago
yellowtownhz / 3DLocalCNN
View on GitHub
☆10Jan 4, 2022Updated 4 years ago
zs1314 / Fraesormer
View on GitHub
【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"
☆13Mar 21, 2025Updated last year
JonghwanMun / LGI4temporalgrounding
View on GitHub
Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"
☆132Jul 5, 2021Updated 5 years ago
zihaomu / resgait
View on GitHub
The benchmark experiments of paper "ReSGait: The real scene gait dataset".
☆12Jul 25, 2024Updated 2 years ago
ikuinen / CMIN_moment_retrieval
View on GitHub
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆87Nov 22, 2020Updated 5 years ago
liuxiaolei88 / Awesome-Text2Video-Retrieval
View on GitHub
The top conferences on video retrieval libraries in recent years, synchronized with my blog.
☆14Nov 27, 2021Updated 4 years ago
supersupercong / MSGNN
View on GitHub
[IJCAI-24] Explore Internal and External Similarity for Single Image Deraining with Graph Neural Networks
☆11Sep 2, 2024Updated last year
Kirili4ik / kws-attention-pytorch
View on GitHub
Keyword spotting for audio with attention (KWS model for audio)
☆18Jul 15, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
alon-albalak / online-data-mixing
View on GitHub
An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.
☆14Jan 9, 2024Updated 2 years ago
gistvision / PSVL
View on GitHub
Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).
☆48Mar 15, 2023Updated 3 years ago
jobinregina / Chaos
View on GitHub
Using machine learning techniques for prediction and modelling non linear dynamic systems.
☆10Jun 29, 2018Updated 8 years ago
microsoft / VideoX
View on GitHub
VideoX: a collection of video cross-modal models
☆1,071Jun 3, 2024Updated 2 years ago
AIM3-RUC / Youmakeup_Challenge2022
View on GitHub
☆17Jun 15, 2022Updated 4 years ago
MathLee / MatlabEvaluationTools
View on GitHub
The evaluation tool (Matlab version) for saliency maps.
☆10Mar 18, 2022Updated 4 years ago
wuchangming / react-interpreter
View on GitHub
React 沙盒 📦，可理解为 React 版的 eval() 。该沙盒运行机制可使基于 React 实现的小程序框架「如 Taro3 等」拥有 🚀 热更新能力。
☆15Mar 7, 2022Updated 4 years ago