video captioning using 3DCNN and LSTM (pytorch)
☆11Sep 26, 2019Updated 6 years ago
Alternatives and similar repositories for VideoCaptioning
Users that are interested in VideoCaptioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Apr 20, 2018Updated 7 years ago
- Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"☆54Jul 9, 2021Updated 4 years ago
- A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.☆74Jul 30, 2023Updated 2 years ago
- ☆11Jul 11, 2023Updated 2 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Nov 22, 2022Updated 3 years ago
- Extract video feature from C3D pretrained on Sports-1M and Kinetics☆15Jul 2, 2019Updated 6 years ago
- Extension of hLSTMat☆19Apr 15, 2021Updated 4 years ago
- An open source project on my CSDN blog, whose dataset is the CNN/DM and whose model is T5.☆12Jul 9, 2023Updated 2 years ago
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆68Nov 19, 2019Updated 6 years ago
- ☆35Mar 22, 2019Updated 7 years ago
- The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…☆16Jun 29, 2017Updated 8 years ago
- Pytorch implementation of audio-visual fusion video captioning model☆27Jul 26, 2018Updated 7 years ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆15Jan 2, 2023Updated 3 years ago
- a repository for remote sensing captions with attention , including Sydney and UCM☆11May 27, 2019Updated 6 years ago
- A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.☆12Nov 15, 2021Updated 4 years ago
- the source code of Multi-modal Circulant Fusion (MCF) for Temporal Activity Localization☆24Mar 10, 2019Updated 7 years ago
- ☆16Jun 2, 2025Updated 9 months ago
- Video captioning on MSR-VTT Dataset☆12Mar 21, 2021Updated 5 years ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆14Mar 29, 2023Updated 2 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- An Android app that provides ranging and indoor positioning for UWB-capable Android devices☆16Dec 2, 2024Updated last year
- Codes for paper of "Attention-based LSTM with Semantic Consistency for Videos Captioning "☆18Mar 22, 2017Updated 9 years ago
- The social-LSTM code for complete trajectory prediction (20 frames). In this repository, the normalized trajectory and non-normalized tra…☆12Apr 16, 2023Updated 2 years ago
- Tool convert Visdrone to COCO☆13Aug 9, 2021Updated 4 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 5 years ago
- DRLSE level set segmentation☆11Oct 24, 2017Updated 8 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- Contains tools for generalized Procrustes analysis, active shape models and shape-based image warping☆10Jun 28, 2014Updated 11 years ago
- TensorFlowLiteNet allows to use TensorFlowLite from C#.☆11Apr 14, 2021Updated 4 years ago
- The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".☆18May 10, 2023Updated 2 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 7 years ago
- (Pattern Recognition 2025) Towards Trustworthy Dataset Distillation☆14Dec 8, 2024Updated last year
- ☆14Apr 25, 2025Updated 10 months ago
- [ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"☆61Oct 20, 2020Updated 5 years ago
- A python implementation of the active shape models by Cootes (http://www2.compute.dtu.dk/courses/02511/docs/asm_overview.pdf)☆10Jul 17, 2018Updated 7 years ago
- ☆13Feb 28, 2025Updated last year
- Pytorch를 활용한 WandB의 Sweeps 🧹☆15Dec 24, 2022Updated 3 years ago