jssprz / attentive_specialized_network_video_captioningLinks

Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*

☆15

Alternatives and similar repositories for attentive_specialized_network_video_captioning

Users that are interested in attentive_specialized_network_video_captioning are comparing it to the libraries listed below

Sorting:

jssprz / visual_syntactic_embedding_video_captioning
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
☆31Updated 4 years ago
jacobswan1 / Video2Commonsense
Video captioning baseline models on Video2Commonsense Dataset.
☆56Updated 4 years ago
hobincar / SGN
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Updated 4 years ago
tgc1997 / Awesome-Video-Captioning
A curated list of research papers in Video Captioning
☆120Updated 4 years ago
StanfordVL / STGraph
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Updated 5 years ago
JaywongWang / CBP
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆59Updated 2 years ago
tgc1997 / RMN
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Updated 4 years ago
hobincar / RecNet
A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018
☆54Updated 5 years ago
iworldtong / TALL.pytorch
PyTorch implementation of "TALL: Temporal Activity Localization via Language Query. Gao et al. ICCV2017."
☆14Updated 6 years ago
niluthpol / weak_supervised_video_moment
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Updated 5 years ago
jayleicn / TVRetrieval
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
☆160Updated last year
v-iashin / MDVC
PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
☆143Updated 2 years ago
ikuinen / CMIN_moment_retrieval
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆86Updated 4 years ago
SydCaption / SAAT
☆62Updated 4 years ago
jayleicn / TVCaption
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆90Updated last year
yj-yu / lsmdc
☆32Updated 6 years ago
WingsBrokenAngel / delving-deeper-into-the-decoder-for-video-captioning
Source code for Delving Deeper into the Decoder for Video Captioning
☆39Updated 4 years ago
Sha-Lab / CMHSE
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Updated 6 years ago
syuqings / video-paragraph
Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021
☆65Updated 3 years ago
jamespark3922 / adv-inf
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
☆34Updated 6 years ago
maurya-rohit / Scene-Graph-For-Videos
☆15Updated 11 months ago
hobincar / SA-LSTM
A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015
☆48Updated 2 years ago
LisaAnne / TemporalLanguageRelease
☆43Updated 4 years ago
MichiganCOG / Video-Grounding-from-Text
Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"
☆46Updated last year
LuoweiZhou / densecap
Dense video captioning in PyTorch
☆41Updated 5 years ago
zfchenUnique / WSSTG
This repository contains the main baselines introduced in WSSTG (ACL 2019).
☆55Updated last year
husthuaan / AAT
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Updated 5 years ago
yytzsy / ABLR_code
The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"
☆29Updated 6 years ago
LuoweiZhou / anet2016-cuhk-feature
Feature Extraction Toolbox from CUHK&ETHZ&SIAT submission to ActivityNet 2016
☆32Updated 6 years ago
TheShadow29 / vognet-pytorch
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆67Updated 5 years ago