yiskw713 / VideoCaptioningLinks
video captioning using 3DCNN and LSTM (pytorch)
☆11Updated 5 years ago
Alternatives and similar repositories for VideoCaptioning
Users that are interested in VideoCaptioning are comparing it to the libraries listed below
Sorting:
- Two-Stream Convolutional Networks for Action Recognition in Videos☆47Updated 6 years ago
- ☆33Updated 7 years ago
- PyTorch implementation of AAAI 2021 paper: A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization☆42Updated 4 years ago
- Pytorch Implementation of Videos as Space-Time Region Graphs☆26Updated 3 months ago
- I3D feature extractor☆44Updated 5 years ago
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Updated 2 years ago
- Pytorch implementation of our T-PAMI 2021 paper: Self-supervised Video Representation Learning by Uncovering Motion and Appearance Stati…☆50Updated 4 years ago
- Pytorch C3D feature extractor☆133Updated 7 years ago
- Extract video feature from C3D pretrained on Sports-1M and Kinetics☆15Updated 6 years ago
- Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization (ECCV 2020)☆47Updated 2 years ago
- PyTorch demo code for "Spatial-Temporal Pyramid Based Convolutional Neural Network for Action Recognition"☆15Updated 6 years ago
- Extension of hLSTMat☆18Updated 4 years ago
- Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization☆27Updated 3 years ago
- Weakly Supervised Temporal Action Localization Using Deep Metric Learning☆27Updated 5 years ago
- Completeness Modeling and Context Separation for Weakly Supervised Temporal Action Localization (CVPR2019)☆152Updated 2 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Updated 5 years ago
- Video Captioning on MSR-VTT and MSVD dataset using Deep Learning☆21Updated 5 years ago
- TCM: Temporal Correlation Module☆17Updated 4 years ago
- Weakly-supervised Action Localization☆49Updated 4 years ago
- Extract video features. Currently, the models includes I3D, will be continuously updated.☆13Updated 5 years ago
- Pytorch 3DNet attention feature map Visualization by [Cam](https://arxiv.org/abs/1512.04150); C3D, R3D, I3D, MF Net is support now!☆66Updated 4 years ago
- Revisiting Anchor Mechanisms for Temporal Action Localization (TIP 2020)☆36Updated 3 years ago
- STPN - Weakly Supervised Action Localization by Sparse Temporal Pooling Network☆82Updated 6 years ago
- Action-Localization, Atomic Visual Actions (AVA) Dataset☆25Updated 5 years ago
- ☆20Updated 5 years ago
- W-TALC: Weakly-supervised Temporal Activity Localization and Classification☆129Updated 6 years ago
- Zero-shot video classification by end-to-end training of 3D convolutional neural networks☆148Updated 5 years ago
- Two-Stream Convolutional Networks for Action Recognition in Videos☆21Updated 3 years ago
- Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering☆27Updated 4 years ago
- The implementation of Sequential VLAD in Pytorch☆19Updated 6 years ago