xiadingZ / video-caption.pytorch
pytorch implementation of video captioning
☆400Updated 5 years ago
Related projects: ⓘ
- This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…☆165Updated 4 years ago
- Video Grounding and Captioning☆320Updated 2 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆148Updated 5 years ago
- ☆187Updated 2 years ago
- Automatic image captioning model based on Caffe, using features from bottom-up attention.☆243Updated last year
- Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"☆188Updated 3 years ago
- Code for Unsupervised Image Captioning☆215Updated last year
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆325Updated 3 years ago
- PyTorch implementation of Image captioning with Bottom-up, Top-down Attention☆160Updated 5 years ago
- This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP☆413Updated last year
- ☆130Updated 5 years ago
- S2VT pytorch implementation☆20Updated 5 years ago
- Evaluation code for Dense-Captioning Events in Videos☆120Updated 5 years ago
- Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning☆107Updated 6 years ago
- A curated list of research papers in Video Captioning☆118Updated 3 years ago
- Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".☆208Updated 4 years ago
- A collection of recent video understanding datasets, under construction!☆454Updated 6 years ago
- Easy to use video deep features extractor☆305Updated 4 years ago
- Pytorch porting of C3D network, with Sports1M weights☆342Updated 5 years ago
- A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…☆292Updated last month
- Unofficial PyTorch Implementation of SUM-GAN from "Unsupervised Video Summarization with Adversarial LSTM Networks" (CVPR 2017)☆239Updated last year
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆67Updated 4 years ago
- Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy☆56Updated 3 years ago
- MUREL (CVPR 2019), a multimodal relational reasoning module for VQA☆194Updated 4 years ago
- ☆220Updated 2 years ago
- Repository for our CVPR 2017 and IJCV: TGIF-QA☆168Updated 3 years ago
- Faster RCNN model in Pytorch version, pretrained on the Visual Genome with ResNet 101☆231Updated last year
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆47Updated last year
- Pytorch C3D feature extractor☆129Updated 6 years ago
- Strong baseline for visual question answering☆238Updated last year