chitwansaharia / HACAModel

Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.org/abs/1804.05448)
26Updated 6 years ago

Related projects

Alternatives and complementary repositories for HACAModel