facebookresearch/grounded-video-description

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/grounded-video-description)

facebookresearch / grounded-video-description

Video Grounding and Captioning

☆331

Alternatives and similar repositories for grounded-video-description

Users that are interested in grounded-video-description are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / ActivityNet-Entities
View on GitHub
A Dataset for Grounded Video Description
☆165Jan 4, 2022Updated 4 years ago
salesforce / densecap
View on GitHub
☆191Jun 16, 2025Updated last year
JaywongWang / DenseVideoCaptioning
View on GitHub
Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…
☆151Jul 8, 2019Updated 7 years ago
xiadingZ / video-caption.pytorch
View on GitHub
pytorch implementation of video captioning
☆400Aug 19, 2019Updated 6 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
forence / Awesome-Visual-Captioning
View on GitHub
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
☆410Nov 14, 2022Updated 3 years ago
XgDuan / WSDEC
View on GitHub
Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…
☆104Mar 21, 2020Updated 6 years ago
LuoweiZhou / anet2016-cuhk-feature
View on GitHub
Feature Extraction Toolbox from CUHK&ETHZ&SIAT submission to ActivityNet 2016
☆32Mar 31, 2019Updated 7 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
ranjaykrishna / densevid_eval
View on GitHub
Evaluation code for Dense-Captioning Events in Videos
☆130Jun 11, 2019Updated 7 years ago
vijayvee / video-captioning
View on GitHub
This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as i…
☆170Oct 12, 2019Updated 6 years ago
vsislab / Controllable_XGating
View on GitHub
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Nov 19, 2019Updated 6 years ago
TheShadow29 / vognet-pytorch
View on GitHub
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆69Jun 10, 2020Updated 6 years ago
LuoweiZhou / detectron-vlp
View on GitHub
Detectron for image/video region feature extraction, inspired by Xinlei's repo
☆22Nov 21, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
zfchenUnique / WSSTG
View on GitHub
This repository contains the main baselines introduced in WSSTG (ACL 2019).
☆57Jul 8, 2024Updated 2 years ago
Abdelrhman-Yasser / video-content-description
View on GitHub
Video content description model for generating descriptions for unconstrained videos
☆15Jul 5, 2019Updated 7 years ago
JaywongWang / CBP
View on GitHub
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆59Mar 24, 2023Updated 3 years ago
jayleicn / TVQAplus
View on GitHub
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
☆132Oct 25, 2022Updated 3 years ago
yytzsy / SCDM
View on GitHub
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
☆71Sep 7, 2021Updated 4 years ago
zhegan27 / SCN_for_video_captioning
View on GitHub
Using Semantic Compositional Networks for Video Captioning
☆96Nov 27, 2018Updated 7 years ago
hobincar / RecNet
View on GitHub
A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018
☆53Apr 6, 2020Updated 6 years ago
v-iashin / MDVC
View on GitHub
PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
☆144Apr 8, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ramakanth-pasunuru / video_captioning_rl
View on GitHub
Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"
☆43Nov 19, 2019Updated 6 years ago
ttengwang / dense-video-captioning-pytorch
View on GitHub
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
☆75Aug 25, 2021Updated 4 years ago
MichiganCOG / Video-Grounding-from-Text
View on GitHub
Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"
☆47Jun 22, 2024Updated 2 years ago
zfchenUnique / VID-Sentence
View on GitHub
This repository provides the dataset introduced by our WSSTG paper
☆13Jul 21, 2019Updated 7 years ago
v-iashin / BMT
View on GitHub
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
☆231Apr 8, 2023Updated 3 years ago
Sundrops / video-caption.pytorch
View on GitHub
☆33Apr 20, 2018Updated 8 years ago
SydCaption / SAAT
View on GitHub
☆62May 11, 2021Updated 5 years ago
tgc1997 / Awesome-Video-Captioning
View on GitHub
A curated list of research papers in Video Captioning
☆121Jan 5, 2021Updated 5 years ago
ttengwang / ESGN
View on GitHub
Event Sequence Generation Network
☆14Jun 22, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jamespark3922 / adv-inf
View on GitHub
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
☆34Jul 17, 2019Updated 7 years ago
jayleicn / TVCaption
View on GitHub
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆91Sep 6, 2023Updated 2 years ago
microsoft / SwinBERT
View on GitHub
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
☆251May 26, 2022Updated 4 years ago
LuoweiZhou / VLP
View on GitHub
Vision-Language Pre-training for Image Captioning and Question Answering
☆420Jan 18, 2022Updated 4 years ago
niluthpol / multimodal_vtt
View on GitHub
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
☆68Apr 10, 2020Updated 6 years ago
simon-ging / coot-videotext
View on GitHub
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
☆291Sep 6, 2022Updated 3 years ago
fawazsammani / show-edit-tell
View on GitHub
Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020
☆82Jul 17, 2020Updated 6 years ago