cshizhe/hgr_v2t

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cshizhe/hgr_v2t)

cshizhe / hgr_v2t

Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".

☆211

Alternatives and similar repositories for hgr_v2t

Users that are interested in hgr_v2t are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

niluthpol / multimodal_vtt
View on GitHub
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
☆68Apr 10, 2020Updated 6 years ago
ikuinen / CMIN_moment_retrieval
View on GitHub
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆87Nov 22, 2020Updated 5 years ago
gabeur / mmt
View on GitHub
Multi-Modal Transformer for Video Retrieval
☆265Oct 9, 2024Updated last year
simon-ging / coot-videotext
View on GitHub
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆291Sep 6, 2022Updated 3 years ago
albanie / collaborative-experts
View on GitHub
Video embeddings for retrieval with natural language queries
☆344Feb 15, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
JonghwanMun / LGI4temporalgrounding
View on GitHub
Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"
☆132Jul 5, 2021Updated 5 years ago
danieljf24 / dual_encoding
View on GitHub
[CVPR2019] Dual Encoding for Zero-Example Video Retrieval
☆153Jan 10, 2023Updated 3 years ago
danieljf24 / awesome-video-text-retrieval
View on GitHub
A curated list of deep learning resources for video-text retrieval.
☆644Oct 20, 2023Updated 2 years ago
CrossmodalGroup / GSMN
View on GitHub
Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching
☆170Oct 12, 2020Updated 5 years ago
niluthpol / weak_supervised_video_moment
View on GitHub
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Jul 20, 2020Updated 6 years ago
liudaizong / CSMGAN
View on GitHub
Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
☆34Sep 3, 2020Updated 5 years ago
mwray / Semantic-Video-Retrieval
View on GitHub
Code and benchmarks for the Semantic Video Retrieval Task
☆53Oct 18, 2022Updated 3 years ago
yytzsy / SCDM
View on GitHub
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
☆71Sep 7, 2021Updated 4 years ago
KunpengLi1994 / VSRN
View on GitHub
PyTorch code for ICCV'19 paper "Visual Semantic Reasoning for Image-Text Matching"
☆304Jan 14, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
antoine77340 / howto100m
View on GitHub
Code for the HowTo100M paper
☆304Mar 10, 2020Updated 6 years ago
jayleicn / TVRetrieval
View on GitHub
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
☆163May 28, 2024Updated 2 years ago
cshizhe / asg2cap
View on GitHub
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …
☆200Dec 1, 2022Updated 3 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
eric-xw / kinetics-i3d-pytorch
View on GitHub
☆35Mar 22, 2019Updated 7 years ago
antoine77340 / MIL-NCE_HowTo100M
View on GitHub
PyTorch GPU distributed training code for MIL-NCE HowTo100M
☆221Jul 5, 2022Updated 4 years ago
antoine77340 / Mixture-of-Embedding-Experts
View on GitHub
Mixture-of-Embeddings-Experts
☆122Jul 21, 2020Updated 6 years ago
youngfly11 / LCMCG-PyTorch
View on GitHub
AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"
☆58Oct 25, 2021Updated 4 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
danieljf24 / hybrid_space
View on GitHub
Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…
☆88Jan 10, 2023Updated 3 years ago
sunnychencool / AOQ
View on GitHub
Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)
☆34Jul 2, 2020Updated 6 years ago
BonnieHuangxin / SLTA
View on GitHub
ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》
☆36Jun 19, 2019Updated 7 years ago
jiyanggao / TALL
View on GitHub
TALL: Temporal Activity Localization via Language Query
☆220Mar 15, 2018Updated 8 years ago
Alvin-Zeng / DRN
View on GitHub
Dense Regression Network for Video Grounding (CVPR2020)
☆53Jan 28, 2021Updated 5 years ago
papermsucode / mdmmt
View on GitHub
MDMMT: Multidomain Multimodal Transformer for Video Retrieval
☆26Jun 28, 2021Updated 5 years ago
dazhang-cv / MAN
View on GitHub
This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"
☆17May 27, 2019Updated 7 years ago
fartashf / vsepp
View on GitHub
PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"
☆523Dec 8, 2021Updated 4 years ago
hardyqr / HAL
View on GitHub
[AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".
☆38Oct 4, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
BruceW91 / CVSE
View on GitHub
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
☆168Feb 7, 2022Updated 4 years ago
Sy-Zhang / TCMN-Release
View on GitHub
Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"
☆16Oct 22, 2022Updated 3 years ago
li-xirong / avs
View on GitHub
Ad-hoc Video Search
☆29Feb 18, 2021Updated 5 years ago
fanchenyou / HME-VideoQA
View on GitHub
Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA
☆55Sep 13, 2021Updated 4 years ago
yiling2018 / saem
View on GitHub
Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019
☆41Sep 24, 2019Updated 6 years ago
yalesong / pvse
View on GitHub
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval (CVPR 2019)
☆135Mar 15, 2024Updated 2 years ago
WingsBrokenAngel / Semantics-AssistedVideoCaptioning
View on GitHub
Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy
☆55Jul 31, 2021Updated 4 years ago