Pytorch implementation of audio-visual fusion video captioning model
☆27Jul 26, 2018Updated 7 years ago
Alternatives and similar repositories for Audio-Visual-Video-Caption
Users that are interested in Audio-Visual-Video-Caption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 8 years ago
- pytorch implementation of video captioning☆401Aug 19, 2019Updated 6 years ago
- Memory-augmented Attention Modelling for Videos☆10Apr 24, 2017Updated 9 years ago
- Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"☆54Jul 9, 2021Updated 4 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Nov 22, 2022Updated 3 years ago
- video captioning using 3DCNN and LSTM (pytorch)☆11Sep 26, 2019Updated 6 years ago
- ☆62May 11, 2021Updated 5 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- ☆14Jan 30, 2017Updated 9 years ago
- ☆33Apr 20, 2018Updated 8 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆151Jul 8, 2019Updated 6 years ago
- A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.☆73Jul 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- implement video caption based on openNMT☆36Apr 19, 2018Updated 8 years ago
- PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)☆144Apr 8, 2023Updated 3 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- A video captioning tool using S2VT method and attention mechanism (TensorFlow)☆15Oct 14, 2018Updated 7 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 5 years ago
- The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…☆16Jun 29, 2017Updated 8 years ago
- ☆10Apr 20, 2018Updated 8 years ago
- Soft attention mechanism for video caption generation☆154Jul 17, 2017Updated 8 years ago
- Study of frame rate effects on MSR-VTT dataset☆14Feb 10, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 6 years ago
- Code for the paper: Audio-Visual Model Distillation Using Acoustic Images☆21Mar 24, 2023Updated 3 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆170Dec 4, 2020Updated 5 years ago
- Experiment with JNI access to some Kaldi functions.☆12Dec 31, 2018Updated 7 years ago
- Dataset and baseline for the first Audiocaption task☆79Jul 25, 2024Updated last year
- A repository for extract CNN features from videos using pytorch☆70Nov 22, 2022Updated 3 years ago
- An image captioning model that is inspired by the Show, Attend and Tell paper (https://arxiv.org/abs/1502.03044) and the Sequence Generat…☆22Sep 4, 2020Updated 5 years ago
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Apr 18, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Video captioning baseline models on Video2Commonsense Dataset.☆56Apr 15, 2021Updated 5 years ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- Simple High performance Infrastructure for Neural network Experiments☆14Sep 25, 2023Updated 2 years ago
- A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.☆12Nov 15, 2021Updated 4 years ago
- Image Caption workout with NIC and NBT☆16Apr 5, 2019Updated 7 years ago
- Video captioning on MSR-VTT Dataset☆12Mar 21, 2021Updated 5 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Oct 24, 2018Updated 7 years ago