daicoolb/Awesome-Video-Captioning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/daicoolb/Awesome-Video-Captioning)

daicoolb / Awesome-Video-Captioning

video captioning

☆24

Alternatives and similar repositories for Awesome-Video-Captioning

Users that are interested in Awesome-Video-Captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
hobincar / SA-LSTM
View on GitHub
A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015
☆48Nov 22, 2022Updated 3 years ago
tgc1997 / Awesome-Video-Captioning
View on GitHub
A curated list of research papers in Video Captioning
☆121Jan 5, 2021Updated 5 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
forence / Awesome-Visual-Captioning
View on GitHub
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
☆410Nov 14, 2022Updated 3 years ago
zhegan27 / SCN_for_video_captioning
View on GitHub
Using Semantic Compositional Networks for Video Captioning
☆96Nov 27, 2018Updated 7 years ago
SydCaption / SAAT
View on GitHub
☆62May 11, 2021Updated 5 years ago
nouman-10 / Image-Captioning
View on GitHub
☆12May 8, 2019Updated 7 years ago
jssprz / attentive_specialized_network_video_captioning
View on GitHub
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
☆15Apr 6, 2021Updated 5 years ago
LibertFan / ImageCaption
View on GitHub
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019
☆17Sep 8, 2019Updated 6 years ago
hobincar / RecNet
View on GitHub
A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018
☆53Apr 6, 2020Updated 6 years ago
eric-xw / Video-guided-Machine-Translation
View on GitHub
Starter code for the VMT task and challenge
☆51Jul 29, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AmingWu / CCN
View on GitHub
Connective Cognition Network for Directional Visual Commonsense Reasoning
☆15May 6, 2021Updated 5 years ago
jayleicn / TVCaption
View on GitHub
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆91Sep 6, 2023Updated 2 years ago
Peratham / video2text.pytorch
View on GitHub
PyTorch implementation of video captioning
☆13Sep 24, 2017Updated 8 years ago
yangbang18 / Non-Autoregressive-Video-Captioning
View on GitHub
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
☆57Oct 22, 2023Updated 2 years ago
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
ahmedssabir / Textual-Visual-Semantic-Dataset-for-Text-Spotting
View on GitHub
Textual Visual Semantic Dataset for Text Spotting. CVPRW 2020
☆12Jul 2, 2022Updated 4 years ago
tuyunbin / Video-Description-with-Spatial-Temporal-Attention
View on GitHub
[ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"
☆61Oct 20, 2020Updated 5 years ago
luo3300612 / Semantics-AssistedVideoCaptioning.pytorch
View on GitHub
pytorch implementation of Semantics-AssistedVideoCaptioning
☆11Feb 16, 2023Updated 3 years ago
jayleicn / VideoLanguageFuturePred
View on GitHub
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆52Aug 20, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OSUPCVLab / VideoToTextDNN
View on GitHub
MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.
☆24Jul 12, 2019Updated 7 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
maurya-rohit / Scene-Graph-For-Videos
View on GitHub
☆15Aug 20, 2024Updated last year
b05902062 / TDConvED
View on GitHub
implementation of TDConvED for video captioning
☆13Mar 18, 2020Updated 6 years ago
wangpengnorman / KB-Ref_dataset
View on GitHub
☆16Dec 28, 2020Updated 5 years ago
jimmy646 / violin
View on GitHub
Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"
☆161Apr 29, 2020Updated 6 years ago
Deanplayerljx / tab-vcr
View on GitHub
Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671
☆19May 6, 2021Updated 5 years ago
pcascanteb / VAE-ImgCaptioning
View on GitHub
Implementation for the project: Variational Image Captioning Using Deterministic Attention
☆13Dec 14, 2018Updated 7 years ago
fkxssaa / Deliberate-Attention-Networks-for-Image-Captioning
View on GitHub
Deliberate Attention Networks for Image Captioning (AAAI 2019)
☆11Sep 30, 2019Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
volkancirik / groundnet
View on GitHub
Repository for AAAI 2018 paper "Using Syntax for Referring Expression Recognition"
☆13Oct 7, 2020Updated 5 years ago
AnnikaLindh / Diverse_and_Specific_Image_Captioning
View on GitHub
Unsupervised specificity-guided optimization of Image Captioning models to encourage meaningful diversity in the generated captions. Code…
☆13May 25, 2025Updated last year
WingsBrokenAngel / delving-deeper-into-the-decoder-for-video-captioning
View on GitHub
Source code for Delving Deeper into the Decoder for Video Captioning
☆39Jun 1, 2021Updated 5 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
airsplay / VisualRelationships
View on GitHub
Data of ACL 2019 Paper "Expressing Visual Relationships via Language".
☆63Sep 30, 2020Updated 5 years ago
yj-yu / lsmdc
View on GitHub
☆33Nov 12, 2018Updated 7 years ago
ruotianluo / GoogleConceptualCaptioning
View on GitHub
☆53Dec 13, 2019Updated 6 years ago