luo3300612/Semantics-AssistedVideoCaptioning.pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/luo3300612/Semantics-AssistedVideoCaptioning.pytorch)

luo3300612 / Semantics-AssistedVideoCaptioning.pytorch

pytorch implementation of Semantics-AssistedVideoCaptioning

☆11

Alternatives and similar repositories for Semantics-AssistedVideoCaptioning.pytorch

Users that are interested in Semantics-AssistedVideoCaptioning.pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WingsBrokenAngel / delving-deeper-into-the-decoder-for-video-captioning
View on GitHub
Source code for Delving Deeper into the Decoder for Video Captioning
☆39Jun 1, 2021Updated 5 years ago
SydCaption / SAAT
View on GitHub
☆62May 11, 2021Updated 5 years ago
lixiangpengcs / Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
View on GitHub
Extension of hLSTMat
☆19Apr 15, 2021Updated 5 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
luogen1996 / LWTransformer
View on GitHub
Lightweight Transformer for Multi-modal Tasks
☆16Dec 9, 2022Updated 3 years ago
WingsBrokenAngel / Semantics-AssistedVideoCaptioning
View on GitHub
Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy
☆55Jul 31, 2021Updated 4 years ago
cshizhe / asg2cap
View on GitHub
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …
☆200Dec 1, 2022Updated 3 years ago
TianYafu / road-status-graph-dataset
View on GitHub
☆21Dec 10, 2020Updated 5 years ago
tgc1997 / Awesome-Video-Captioning
View on GitHub
A curated list of research papers in Video Captioning
☆121Jan 5, 2021Updated 5 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
LeeYN-43 / Clover
View on GitHub
Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)
☆40Feb 15, 2023Updated 3 years ago
YouHuang67 / mamba-code-explained
View on GitHub
☆19Jan 7, 2026Updated 6 months ago
jssprz / attentive_specialized_network_video_captioning
View on GitHub
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
☆15Apr 6, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jssprz / video_features_extractor
View on GitHub
Python implementation of extraction of several visual features representations from videos
☆23Jul 19, 2021Updated 5 years ago
Curious-Geek / Video-Captioning
View on GitHub
Study of frame rate effects on MSR-VTT dataset
☆14Feb 10, 2018Updated 8 years ago
Kien085 / SG2Caps
View on GitHub
☆23Aug 21, 2021Updated 4 years ago
jssprz / visual_syntactic_embedding_video_captioning
View on GitHub
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
☆30Apr 16, 2021Updated 5 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
tangyuhao2016 / CTRG
View on GitHub
☆19Aug 21, 2023Updated 2 years ago
stephen-pilli / DeepSentiBank
View on GitHub
☆17Aug 6, 2021Updated 4 years ago
hobincar / SA-LSTM
View on GitHub
A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015
☆48Nov 22, 2022Updated 3 years ago
luo3300612 / Transformer-Captioning
View on GitHub
Optimized code based on M2 for faster image captioning training
☆21Nov 18, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ruotianluo / GoogleConceptualCaptioning
View on GitHub
☆53Dec 13, 2019Updated 6 years ago
daicoolb / Awesome-Video-Captioning
View on GitHub
video captioning
☆24Mar 14, 2019Updated 7 years ago
ruotianluo / Transformer_Captioning
View on GitHub
Use transformer for captioning
☆156May 2, 2019Updated 7 years ago
xmu-xiaoma666 / X-Mesh
View on GitHub
A pytorch implementation of “ X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance”
☆29Jan 12, 2024Updated 2 years ago
hello-robot / stretch_urdf
View on GitHub
URDFs for the Stretch mobile manipulators from Hello Robot Inc.
☆15Jun 5, 2026Updated last month
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
forence / Awesome-Visual-Captioning
View on GitHub
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
☆410Nov 14, 2022Updated 3 years ago
yangbang18 / CARE
View on GitHub
(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
☆32Dec 26, 2024Updated last year
jayleicn / TVCaption
View on GitHub
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆91Sep 6, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yytzsy / SMCG
View on GitHub
Code for the paper "Controllable Video Captioning with an Exemplar Sentence"
☆12Apr 14, 2021Updated 5 years ago
WeihuangLin / INF-LLaVA
View on GitHub
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
☆42Aug 4, 2024Updated last year
VideoAnalysis / EDUVSUM
View on GitHub
EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important tempo…
☆23Mar 8, 2024Updated 2 years ago
YouHuang67 / High-Resolution-Segment-Anything
View on GitHub
☆34Jul 4, 2024Updated 2 years ago
zhegan27 / Semantic_Compositional_Nets
View on GitHub
The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"
☆68Mar 26, 2018Updated 8 years ago
eric-xw / kinetics-i3d-pytorch
View on GitHub
☆35Mar 22, 2019Updated 7 years ago
jwyang / DeepLearning-500-questions
View on GitHub
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，近30万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系sc…
☆26Dec 9, 2018Updated 7 years ago