yiskw713 / VideoCaptioning
video captioning using 3DCNN and LSTM (pytorch)
☆10Updated 4 years ago
Related projects: ⓘ
- ☆32Updated 6 years ago
- I3D feature extractor☆43Updated 4 years ago
- Extract video feature from C3D pretrained on Sports-1M and Kinetics☆14Updated 5 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆22Updated 4 years ago
- PyTorch implementation of AAAI 2021 paper: A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization☆40Updated 3 years ago
- Pytorch Implementation of Videos as Space-Time Region Graphs☆26Updated last month
- Pytorch implementation of audio-visual fusion video captioning model☆25Updated 6 years ago
- Pytorch implementation of our T-PAMI 2021 paper: Self-supervised Video Representation Learning by Uncovering Motion and Appearance Stati…☆49Updated 3 years ago
- Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization (ECCV 2020)☆46Updated last year
- Weakly Supervised Temporal Action Localization Using Deep Metric Learning☆29Updated 4 years ago
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆47Updated last year
- ☆16Updated this week
- Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering☆24Updated 3 years ago
- ☆15Updated last month
- This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"☆17Updated 5 years ago
- Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"☆52Updated 3 years ago
- ☆20Updated 5 years ago
- The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"☆30Updated 5 years ago
- Video Captioning on MSR-VTT and MSVD dataset using Deep Learning☆21Updated 4 years ago
- Extract video features. Currently, the models includes I3D, will be continuously updated.☆12Updated 4 years ago
- ☆16Updated 5 years ago
- PyTorch implementation of "TALL: Temporal Activity Localization via Language Query. Gao et al. ICCV2017."☆13Updated 5 years ago
- a way to download the dataset of ActivityNet☆23Updated 6 years ago
- video captioning☆24Updated 5 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Updated 5 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆42Updated 4 years ago
- Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization☆33Updated 4 years ago
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Updated 4 years ago
- Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video (AAAI2020)☆45Updated 4 years ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆16Updated 5 years ago