chitwansaharia/HACAModel

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chitwansaharia/HACAModel)

chitwansaharia / HACAModel

Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.org/abs/1804.05448)

☆26

Alternatives and similar repositories for HACAModel

Users that are interested in HACAModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

b05902062 / TDConvED
View on GitHub
implementation of TDConvED for video captioning
☆13Mar 18, 2020Updated 6 years ago
jamespark3922 / adv-inf
View on GitHub
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
☆34Jul 17, 2019Updated 7 years ago
lixiangpengcs / Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
View on GitHub
Extension of hLSTMat
☆19Apr 15, 2021Updated 5 years ago
smallflyingpig / pytorch_video_caption
View on GitHub
some models for video caption implemented by pytorch. (S2VT)
☆23Feb 1, 2018Updated 8 years ago
hobincar / reconstruction-network-for-video-captioning
View on GitHub
☆20Sep 19, 2019Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
v-iashin / MDVC
View on GitHub
PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)
☆144Apr 8, 2023Updated 3 years ago
vsislab / Controllable_XGating
View on GitHub
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Nov 19, 2019Updated 6 years ago
DataScienceNigeria / AI-powered-by-Google-s-VideoBERT-
View on GitHub
☆10Sep 26, 2019Updated 6 years ago
sususushi / reconstruction-network-for-video-captioning
View on GitHub
☆16Dec 17, 2018Updated 7 years ago
VisionLearningGroup / JEDDi-Net
View on GitHub
Implementation for "Joint Event Detection and Description in Continuous Video Streams"
☆23Nov 4, 2020Updated 5 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
DuJiajun1994 / ImageCaptionGAN
View on GitHub
☆10May 10, 2019Updated 7 years ago
zhaoluffy / hLSTMat
View on GitHub
The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…
☆16Jun 29, 2017Updated 9 years ago
Sundrops / video-caption.pytorch
View on GitHub
☆33Apr 20, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ramakanth-pasunuru / video_captioning_rl
View on GitHub
Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"
☆43Nov 19, 2019Updated 6 years ago
adwardlee / multitask-end-to-end-video-captioning
View on GitHub
with reinforcement learning
☆32May 19, 2020Updated 6 years ago
WingsBrokenAngel / Semantics-AssistedVideoCaptioning
View on GitHub
Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy
☆55Jul 31, 2021Updated 4 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
hobincar / SA-LSTM
View on GitHub
A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015
☆48Nov 22, 2022Updated 3 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
View on GitHub
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34May 14, 2020Updated 6 years ago
NLP2CT / ua-cl-nmt
View on GitHub
Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)
☆11Jun 12, 2020Updated 6 years ago
XgDuan / WSDEC
View on GitHub
Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…
☆104Mar 21, 2020Updated 6 years ago
salesforce / densecap
View on GitHub
☆191Jun 16, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
INK-USC / VisCOLL
View on GitHub
Code and data for the project "Visually grounded continual learning of compositional semantics"
☆22Dec 27, 2022Updated 3 years ago
niluthpol / multimodal_vtt
View on GitHub
Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval
☆68Apr 10, 2020Updated 6 years ago
ttengwang / ESGN
View on GitHub
Event Sequence Generation Network
☆14Jun 22, 2021Updated 5 years ago
hardyqr / HAL
View on GitHub
[AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".
☆38Oct 4, 2023Updated 2 years ago
shengyuzhang / Poet
View on GitHub
Poet: Product-oriented Video Captioner for E-commerce
☆12Sep 21, 2020Updated 5 years ago
AmingWu / Multi-modal-Circulant-Fusion
View on GitHub
the source code of Multi-modal Circulant Fusion (MCF) for Temporal Activity Localization
☆24Mar 10, 2019Updated 7 years ago
loscheris / VideoCaptioning_att
View on GitHub
A video captioning tool using S2VT method and attention mechanism (TensorFlow)
☆15Oct 14, 2018Updated 7 years ago
cswhjiang / Recurrent_Fusion_Network
View on GitHub
Source code for "Recurrent Fusion Network for Image Captioning".
☆23Nov 24, 2018Updated 7 years ago
nyu-dl / dl4mt-seqgen
View on GitHub
☆31Jun 13, 2019Updated 7 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
mynlp / cst_captioning
View on GitHub
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
☆60May 15, 2018Updated 8 years ago
heejin928 / How-Positive-Are-You-Text-Style-Transfer-using-Adaptive-Style-Embedding
View on GitHub
☆14Oct 30, 2021Updated 4 years ago
MarcBS / TMA
View on GitHub
Egocentric Video Description based on Temporally-Linked Sequences
☆11Jul 17, 2017Updated 9 years ago
tgc1997 / Awesome-Video-Captioning
View on GitHub
A curated list of research papers in Video Captioning
☆121Jan 5, 2021Updated 5 years ago
fkxssaa / Deliberate-Attention-Networks-for-Image-Captioning
View on GitHub
Deliberate Attention Networks for Image Captioning (AAAI 2019)
☆11Sep 30, 2019Updated 6 years ago
rasoolfa / videocap
View on GitHub
Memory-augmented Attention Modelling for Videos
☆10Apr 24, 2017Updated 9 years ago