hobincar / SGNLinks

Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"

☆54

Alternatives and similar repositories for SGN

Users that are interested in SGN are comparing it to the libraries listed below

Sorting:

yangbang18 / Non-Autoregressive-Video-Captioning
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
☆58Updated last year
SydCaption / SAAT
☆62Updated 4 years ago
tgc1997 / RMN
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Updated 4 years ago
jssprz / visual_syntactic_embedding_video_captioning
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
☆31Updated 4 years ago
hobincar / RecNet
A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018
☆54Updated 5 years ago
ttengwang / dense-video-captioning-pytorch
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
☆75Updated 3 years ago
vsislab / Controllable_XGating
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Updated 5 years ago
tgc1997 / Awesome-Video-Captioning
A curated list of research papers in Video Captioning
☆120Updated 4 years ago
devtrace404 / Video-Captioning-Using-Object-Trajectory-Features
Video Captioning on MSR-VTT and MSVD dataset using Deep Learning
☆21Updated 4 years ago
liudaizong / CSMGAN
Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
☆34Updated 4 years ago
WingsBrokenAngel / Semantics-AssistedVideoCaptioning
Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy
☆55Updated 4 years ago
StanfordVL / STGraph
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Updated 5 years ago
hobincar / SA-LSTM
A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015
☆48Updated 2 years ago
ikuinen / CMIN_moment_retrieval
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆86Updated 4 years ago
syuqings / video-paragraph
Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021
☆65Updated 3 years ago
YiwuZhong / Sub-GC
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
☆97Updated 11 months ago
MarcusNerva / HMN
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
☆52Updated 2 years ago
niluthpol / weak_supervised_video_moment
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Updated 5 years ago
nasib-ullah / video-captioning-models-in-Pytorch
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
☆73Updated 2 years ago
thaolmk54 / hcrn-videoqa
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆133Updated last year
cshizhe / hgr_v2t
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
☆211Updated 5 years ago
jacobswan1 / Video2Commonsense
Video captioning baseline models on Video2Commonsense Dataset.
☆56Updated 4 years ago
Sha-Lab / CMHSE
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Updated 6 years ago
MILVLG / mt-captioning
A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning
☆25Updated 4 years ago
crodriguezo / TMLGA
Repository of proposal-free temporal moment localization work
☆33Updated last year
violetteshev / bottom-up-features
Bottom-up features extractor implemented in PyTorch.
☆72Updated 5 years ago
tanghaoyu258 / ACRM-for-moment-retrieval
☆27Updated 2 years ago
yiling2018 / saem
Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019
☆41Updated 5 years ago
aioz-ai / ICCV19_VQA-CTI
Compact Trilinear Interaction for Visual Question Answering (ICCV 2019)
☆38Updated 2 years ago
JonghwanMun / LGI4temporalgrounding
Repository for the CVPR-20 paper "Local-Global Video-Text Interactions for Temporal Grounding"
☆131Updated 4 years ago