oddguan/Audio-Visual-Video-Caption

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/oddguan/Audio-Visual-Video-Caption)

oddguan / Audio-Visual-Video-Caption

Pytorch implementation of audio-visual fusion video captioning model

☆27

Alternatives and similar repositories for Audio-Visual-Video-Caption

Users that are interested in Audio-Visual-Video-Caption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xiadingZ / video-caption.pytorch
View on GitHub
pytorch implementation of video captioning
☆400Aug 19, 2019Updated 6 years ago
rasoolfa / videocap
View on GitHub
Memory-augmented Attention Modelling for Videos
☆10Apr 24, 2017Updated 9 years ago
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
ramakanth-pasunuru / video_captioning_rl
View on GitHub
Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"
☆43Nov 19, 2019Updated 6 years ago
smallflyingpig / pytorch_video_caption
View on GitHub
some models for video caption implemented by pytorch. (S2VT)
☆23Feb 1, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
SydCaption / SAAT
View on GitHub
☆62May 11, 2021Updated 5 years ago
yiskw713 / VideoCaptioning
View on GitHub
video captioning using 3DCNN and LSTM (pytorch)
☆11Sep 26, 2019Updated 6 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
hehefan / Video-Captioning
View on GitHub
☆14Jan 30, 2017Updated 9 years ago
Sundrops / video-caption.pytorch
View on GitHub
☆33Apr 20, 2018Updated 8 years ago
JaywongWang / DenseVideoCaptioning
View on GitHub
Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…
☆151Jul 8, 2019Updated 7 years ago
nasib-ullah / video-captioning-models-in-Pytorch
View on GitHub
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
☆73Jul 30, 2023Updated 2 years ago
luo3300612 / Semantics-AssistedVideoCaptioning.pytorch
View on GitHub
pytorch implementation of Semantics-AssistedVideoCaptioning
☆11Feb 16, 2023Updated 3 years ago
xiadingZ / video-caption-openNMT.pytorch
View on GitHub
implement video caption based on openNMT
☆36Apr 19, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sususushi / reconstruction-network-for-video-captioning
View on GitHub
☆16Dec 17, 2018Updated 7 years ago
lixiangpengcs / Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
View on GitHub
Extension of hLSTMat
☆19Apr 15, 2021Updated 5 years ago
loscheris / VideoCaptioning_att
View on GitHub
A video captioning tool using S2VT method and attention mechanism (TensorFlow)
☆15Oct 14, 2018Updated 7 years ago
zhaoluffy / hLSTMat
View on GitHub
The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…
☆16Jun 29, 2017Updated 9 years ago
marcopede / AreasOfAttention
View on GitHub
☆10Apr 20, 2018Updated 8 years ago
tsenghungchen / SA-tensorflow
View on GitHub
Soft attention mechanism for video caption generation
☆154Jul 17, 2017Updated 9 years ago
pochih / Video-Cap
View on GitHub
🎬 Video Captioning: ICCV '15 paper implementation
☆47May 30, 2018Updated 8 years ago
Curious-Geek / Video-Captioning
View on GitHub
Study of frame rate effects on MSR-VTT dataset
☆14Feb 10, 2018Updated 8 years ago
hobincar / RecNet
View on GitHub
A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018
☆53Apr 6, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
afperezm / acoustic-images-distillation
View on GitHub
Code for the paper: Audio-Visual Model Distillation Using Acoustic Images
☆21Mar 24, 2023Updated 3 years ago
zhaoluffy / aLSTMs
View on GitHub
Codes for paper of "Attention-based LSTM with Semantic Consistency for Videos Captioning "
☆18Mar 22, 2017Updated 9 years ago
jd730 / STRG
View on GitHub
Pytorch Implementation of Videos as Space-Time Region Graphs
☆27Updated this week
hahaha108 / JDSpider
View on GitHub
京东爬虫，可以实现输入一个关键字后自动爬取相关的商品信息，也可以用于自定义爬取商品的评论。
☆11Mar 23, 2018Updated 8 years ago
Anjaney1999 / image-captioning-seqgan
View on GitHub
An image captioning model that is inspired by the Show, Attend and Tell paper (https://arxiv.org/abs/1502.03044) and the Sequence Generat…
☆22Sep 4, 2020Updated 5 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
ahmetaa / kaldi-jni
View on GitHub
Experiment with JNI access to some Kaldi functions.
☆12Dec 31, 2018Updated 7 years ago
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
tuyunbin / Video-Description-with-Spatial-Temporal-Attention
View on GitHub
[ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"
☆61Oct 20, 2020Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
lyulyul / shine-cluster
View on GitHub
Simple High performance Infrastructure for Neural network Experiments
☆14Sep 25, 2023Updated 2 years ago
ezeli / Transformer_model
View on GitHub
A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.
☆12Nov 15, 2021Updated 4 years ago
rakshithShetty / captionGAN
View on GitHub
Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"
☆66Apr 18, 2019Updated 7 years ago
Wentong-DST / up-down-captioner
View on GitHub
Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"
☆29Oct 24, 2018Updated 7 years ago
nviable / deepfake-blips
View on GitHub
Multimodal late fusion for deepfake detection using video and audio data
☆12May 7, 2019Updated 7 years ago
yugaljain1999 / Video_Captioning_Pytorch
View on GitHub
Video captioning on MSR-VTT Dataset
☆12Mar 21, 2021Updated 5 years ago
Franceshe / awesome-generative-models
View on GitHub
A collection of awesome generative model papers, frameworks, libraries, software and resources for text, image, video, animation, code ge…
☆25May 27, 2021Updated 5 years ago