Peratham/video2text.pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Peratham/video2text.pytorch)

Peratham / video2text.pytorch

PyTorch implementation of video captioning

☆13

Alternatives and similar repositories for video2text.pytorch

Users that are interested in video2text.pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xiadingZ / video-caption-openNMT.pytorch
View on GitHub
implement video caption based on openNMT
☆36Apr 19, 2018Updated 8 years ago
loscheris / VideoCaptioning_att
View on GitHub
A video captioning tool using S2VT method and attention mechanism (TensorFlow)
☆15Oct 14, 2018Updated 7 years ago
bowong / Layered-Memory-Network
View on GitHub
A Layered Memory Network for MovieQA
☆16Apr 27, 2018Updated 8 years ago
WingsBrokenAngel / delving-deeper-into-the-decoder-for-video-captioning
View on GitHub
Source code for Delving Deeper into the Decoder for Video Captioning
☆39Jun 1, 2021Updated 5 years ago
maurya-rohit / Scene-Graph-For-Videos
View on GitHub
☆15Aug 20, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
hobincar / reconstruction-network-for-video-captioning
View on GitHub
☆20Sep 19, 2019Updated 6 years ago
hobincar / SA-LSTM
View on GitHub
A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015
☆48Nov 22, 2022Updated 3 years ago
Mahmoud9876 / locationProblem
View on GitHub
The Multi-Capacity and Multi-Level Localization Project tackles the complex problem of finding optimal locations for elements such as fac…
☆13Aug 19, 2025Updated 11 months ago
smallflyingpig / pytorch_video_caption
View on GitHub
some models for video caption implemented by pytorch. (S2VT)
☆23Feb 1, 2018Updated 8 years ago
zhaoluffy / aLSTMs
View on GitHub
Codes for paper of "Attention-based LSTM with Semantic Consistency for Videos Captioning "
☆18Mar 22, 2017Updated 9 years ago
afzaalis / tubes-makepal-cicd
View on GitHub
☆15Aug 2, 2025Updated 11 months ago
mynlp / cst_captioning
View on GitHub
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
☆60May 15, 2018Updated 8 years ago
Curious-Geek / Video-Captioning
View on GitHub
Study of frame rate effects on MSR-VTT dataset
☆14Feb 10, 2018Updated 8 years ago
daicoolb / Awesome-Video-Captioning
View on GitHub
video captioning
☆24Mar 14, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hobincar / RecNet
View on GitHub
A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018
☆53Apr 6, 2020Updated 6 years ago
ffmpbgrnn / tflibs
View on GitHub
☆25Sep 8, 2017Updated 8 years ago
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
Clearloveyuan / awesome-Radiology-Report-Generation
View on GitHub
Paper List about Radiology Report Generation and also some medical image captioning
☆11Oct 5, 2021Updated 4 years ago
OSUPCVLab / VideoToTextDNN
View on GitHub
MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.
☆24Jul 12, 2019Updated 7 years ago
vsislab / Controllable_XGating
View on GitHub
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Nov 19, 2019Updated 6 years ago
Wind-Ward / Image_Caption_Competition
View on GitHub
AI Challenger Image Caption Competition
☆10Dec 13, 2017Updated 8 years ago
entalent / MemCap
View on GitHub
code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`
☆11Mar 17, 2020Updated 6 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TexasInstruments-Sandbox / edgeai-gst-apps-barcode-reader
View on GitHub
Gstreamer based Edge AI reference application
☆13Feb 26, 2024Updated 2 years ago
Sundrops / video-caption.pytorch
View on GitHub
☆33Apr 20, 2018Updated 8 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
Smartory / Augmentory
View on GitHub
A tool for deep image processing's dataset augmentation
☆17Jul 13, 2026Updated last week
ramakanth-pasunuru / video_captioning_rl
View on GitHub
Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"
☆43Nov 19, 2019Updated 6 years ago
warehouse-picking-automation-challenges / nimbro_picking
View on GitHub
☆20Jan 30, 2020Updated 6 years ago
rutaabali3 / BridgeX
View on GitHub
Aptech's E project. BridgeX is a web based platform showcasing the world's most remarkable bridges, their engineering marvels, and histor…
☆16Apr 15, 2026Updated 3 months ago
ZhecanJamesWang / GLAT_SGG
View on GitHub
Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"
☆11Dec 16, 2020Updated 5 years ago
otoolej / nonlinear-energy-operators
View on GitHub
measures to assess frequency-weighted instantaneous energy
☆18Apr 4, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
robinzhoucmu / PlanarManipulationToolBox
View on GitHub
A simulation, planning and control toolbox for planar manipulation (e.g., pushing and grasping).
☆26Jul 8, 2017Updated 9 years ago
criticalml-uw / LOCATEdit
View on GitHub
Official implementation of "LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Guided Image Editing
☆16May 27, 2025Updated last year
yrcong / NODIS
View on GitHub
Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020
☆12Aug 28, 2020Updated 5 years ago
cxzheng / time-dep-hjb
View on GitHub
time-dependent Hamilton-Jacobi PDEs (http://www.cs.columbia.edu/~cxz/TimeDepHJB/)
☆14Feb 5, 2017Updated 9 years ago
cshizhe / asg2cap
View on GitHub
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …
☆200Dec 1, 2022Updated 3 years ago
JaywongWang / SST-Tensorflow
View on GitHub
Tensorflow Implementation of the Paper "SST: Single-Stream Temporal Action Proposals" in CVPR 2017.
☆48Aug 20, 2018Updated 7 years ago
aws-samples / amazon-sagemaker-host-and-inference-whisper-model
View on GitHub
☆18Jan 24, 2024Updated 2 years ago