v-iashin/MDVC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/v-iashin/MDVC)

v-iashin / MDVC

PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)

☆144

Alternatives and similar repositories for MDVC

Users that are interested in MDVC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

v-iashin / BMT
View on GitHub
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
☆231Apr 8, 2023Updated 3 years ago
chitwansaharia / HACAModel
View on GitHub
Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…
☆26Nov 3, 2018Updated 7 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
ttengwang / PDVC
View on GitHub
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
☆230Jan 3, 2024Updated 2 years ago
ttengwang / dense-video-captioning-pytorch
View on GitHub
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
☆75Aug 25, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jayleicn / TVCaption
View on GitHub
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆91Sep 6, 2023Updated 2 years ago
salesforce / densecap
View on GitHub
☆191Jun 16, 2025Updated last year
JaywongWang / DenseVideoCaptioning
View on GitHub
Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…
☆151Jul 8, 2019Updated 7 years ago
forence / Awesome-Visual-Captioning
View on GitHub
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
☆410Nov 14, 2022Updated 3 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
MDSKUL / MasterProject
View on GitHub
Code voor mijn Master project omtrent VideoBERT
☆39Nov 25, 2020Updated 5 years ago
XgDuan / WSDEC
View on GitHub
Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…
☆104Mar 21, 2020Updated 6 years ago
tgc1997 / Awesome-Video-Captioning
View on GitHub
A curated list of research papers in Video Captioning
☆121Jan 5, 2021Updated 5 years ago
iriscxy / VMSMO
View on GitHub
Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
☆36Jul 30, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
facebookresearch / grounded-video-description
View on GitHub
Video Grounding and Captioning
☆331Oct 12, 2021Updated 4 years ago
smallflyingpig / pytorch_video_caption
View on GitHub
some models for video caption implemented by pytorch. (S2VT)
☆23Feb 1, 2018Updated 8 years ago
v-iashin / video_features
View on GitHub
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…
☆653Feb 1, 2026Updated 5 months ago
yangbang18 / Non-Autoregressive-Video-Captioning
View on GitHub
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
☆57Oct 22, 2023Updated 2 years ago
simon-ging / coot-videotext
View on GitHub
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆291Sep 6, 2022Updated 3 years ago
VideoAnalysis / EDUVSUM
View on GitHub
EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important tempo…
☆23Mar 8, 2024Updated 2 years ago
ranjaykrishna / densevid_eval
View on GitHub
Evaluation code for Dense-Captioning Events in Videos
☆130Jun 11, 2019Updated 7 years ago
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
cshizhe / hgr_v2t
View on GitHub
Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".
☆211Jun 12, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jayleicn / TVRetrieval
View on GitHub
[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
☆163May 28, 2024Updated 2 years ago
WingsBrokenAngel / delving-deeper-into-the-decoder-for-video-captioning
View on GitHub
Source code for Delving Deeper into the Decoder for Video Captioning
☆39Jun 1, 2021Updated 5 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
View on GitHub
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34May 14, 2020Updated 6 years ago
VP-0822 / Video-Keyword-Extractor
View on GitHub
A Master Thesis Project on Video Keyword Extractor using Video Summarization techniques.
☆11Oct 25, 2020Updated 5 years ago
syuqings / video-paragraph
View on GitHub
Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021
☆66Oct 21, 2021Updated 4 years ago
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
xiadingZ / video-caption.pytorch
View on GitHub
pytorch implementation of video captioning
☆400Aug 19, 2019Updated 6 years ago
ramakanth-pasunuru / video_captioning_rl
View on GitHub
Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"
☆43Nov 19, 2019Updated 6 years ago
linjieli222 / HERO
View on GitHub
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆235Sep 16, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
microsoft / UniVL
View on GitHub
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
☆365Jul 25, 2024Updated last year
hobincar / SA-LSTM
View on GitHub
A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015
☆48Nov 22, 2022Updated 3 years ago
b05902062 / TDConvED
View on GitHub
implementation of TDConvED for video captioning
☆13Mar 18, 2020Updated 6 years ago
SydCaption / SAAT
View on GitHub
☆62May 11, 2021Updated 5 years ago
hobincar / reconstruction-network-for-video-captioning
View on GitHub
☆20Sep 19, 2019Updated 6 years ago
26hzhang / VSLNet
View on GitHub
Span-based Localizing Network for Natural Language Video Localization (ACL 2020)
☆113Oct 15, 2021Updated 4 years ago
vsislab / Controllable_XGating
View on GitHub
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Nov 19, 2019Updated 6 years ago