zohrehghaderi / VASTALinks

A Video-to-Text Framework

☆10

Alternatives and similar repositories for VASTA

Users that are interested in VASTA are comparing it to the libraries listed below

Sorting:

MarcusNerva / HMN
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
☆52Updated 2 years ago
26hzhang / ReLoCLNet
Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)
☆58Updated 4 years ago
sangminwoo / Explore-And-Match
Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …
☆42Updated 3 years ago
crodriguezo / TMLGA
Repository of proposal-free temporal moment localization work
☆33Updated last year
HuiGuanLab / DL-DKD
ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval
☆17Updated last month
Soldelli / VLG-Net
VLG-Net: Video-Language Graph Matching Networks for Video Grounding
☆31Updated 3 years ago
liupeng0606 / clip4caption
The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)
☆15Updated 2 years ago
r-cui / ViGA
"Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022
☆69Updated 3 years ago
niluthpol / weak_supervised_video_moment
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Updated 5 years ago
tanghaoyu258 / ACRM-for-moment-retrieval
☆27Updated 3 years ago
HuiGuanLab / ms-sl
Source code of our MM'22 paper Partially Relevant Video Retrieval
☆54Updated 10 months ago
showlab / mist
☆36Updated last year
minghangz / cnm
Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining
☆29Updated 3 years ago
Huntersxsx / TSGV-Learning-List
Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作
☆29Updated 3 years ago
hobincar / SGN
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Updated 4 years ago
LiJiaBei-7 / rivrl
Source code of our TCSVT'22 paper Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval
☆19Updated 3 years ago
YYJMJC / Compositional-Temporal-Grounding
☆31Updated 3 years ago
haojc / ShufflingVideosForTSG
Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"
☆29Updated 2 years ago
yangbang18 / CARE
(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
☆30Updated 8 months ago
ylqi / GL-RG
The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".
☆19Updated 2 years ago
SCZwangxiao / Temporal-Language-Grounding-in-videos
Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval
☆98Updated 3 years ago
bofang98 / UATVR
[ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval
☆13Updated last year
nasib-ullah / video-captioning-models-in-Pytorch
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
☆73Updated 2 years ago
26hzhang / VSLNet
Span-based Localizing Network for Natural Language Video Localization (ACL 2020)
☆109Updated 3 years ago
crodriguezo / DORi
Public repository for DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video Code accompan…
☆21Updated 4 years ago
liudaizong / CSMGAN
Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
☆34Updated 5 years ago
houzhijian / CONQUER
[2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval
☆42Updated 3 years ago
yytzsy / grounding_changing_distribution
☆36Updated 4 years ago
Alvin-Zeng / DRN
Dense Regression Network for Video Grounding (CVPR2020)
☆53Updated 4 years ago
RyanLiut / awesome-diverse-captioning
Some papers about *diverse* image (a few videos) captioning
☆26Updated 2 years ago