captanlevi / Meaning-guided-video-captioning-Links

Here we describe a new approach to train a video captioning neural network , that is not only based on the normal cross entropy loss for the caption but also uses the meaning of the caption.

☆7

Alternatives and similar repositories for Meaning-guided-video-captioning-

Users that are interested in Meaning-guided-video-captioning- are comparing it to the libraries listed below

Sorting:

jamespark3922 / adv-inf
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
☆34Updated 6 years ago
daicoolb / Awesome-Video-Captioning
video captioning
☆24Updated 6 years ago
chitwansaharia / HACAModel
Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…
☆26Updated 6 years ago
JaywongWang / CBP
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆60Updated 2 years ago
LuoweiZhou / densecap
Dense video captioning in PyTorch
☆41Updated 5 years ago
ramakanth-pasunuru / video_captioning_rl
Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"
☆44Updated 5 years ago
MichiganCOG / Video-Grounding-from-Text
Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"
☆46Updated last year
StanfordVL / STGraph
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Updated 5 years ago
yj-yu / lsmdc
☆32Updated 6 years ago
VisionLearningGroup / Text-to-Clip_Retrieval
Implementation for "Multilevel Language and Vision Integration for Text-to-Clip Retrieval"
☆50Updated 6 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34Updated 5 years ago
ranjaykrishna / densevid_eval
Evaluation code for Dense-Captioning Events in Videos
☆128Updated 6 years ago
LuoweiZhou / anet2016-cuhk-feature
Feature Extraction Toolbox from CUHK&ETHZ&SIAT submission to ActivityNet 2016
☆32Updated 6 years ago
niluthpol / multimodal_vtt
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
☆67Updated 5 years ago
VisionLearningGroup / JEDDi-Net
Implementation for "Joint Event Detection and Description in Continuous Video Streams"
☆23Updated 4 years ago
LuoweiZhou / ProcNets-YouCook2
Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"
☆34Updated 6 years ago
XiangChenchao / DDPN
Rethinking Diversified and Discriminative Proposal Generation for Visual Grounding
☆23Updated 7 years ago
jacobswan1 / Video2Commonsense
Video captioning baseline models on Video2Commonsense Dataset.
☆56Updated 4 years ago
XgDuan / WSDEC
Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…
☆104Updated 5 years ago
jayleicn / TVCaption
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆90Updated last year
WuJie1010 / Temporally-language-grounding
A Pytorch implemention for some state-of-the-art models for" Temporally Language Grounding in Untrimmed Videos"
☆96Updated 5 years ago
fanchenyou / HME-VideoQA
Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA
☆54Updated 3 years ago
iworldtong / TALL.pytorch
PyTorch implementation of "TALL: Temporal Activity Localization via Language Query. Gao et al. ICCV2017."
☆14Updated 6 years ago
JaywongWang / DenseVideoCaptioning
Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…
☆150Updated 6 years ago
hobincar / SA-LSTM
A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015
☆48Updated 2 years ago
KaihuaTang / VCTree-Visual-Question-Answering
Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contex…
☆34Updated 6 years ago
oddguan / Audio-Visual-Video-Caption
Pytorch implementation of audio-visual fusion video captioning model
☆27Updated 6 years ago
husthuaan / AAT
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Updated 5 years ago
yytzsy / SCDM
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
☆71Updated 3 years ago
chihyaoma / cyclical-visual-captioning
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆44Updated 4 years ago