Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"
☆29Oct 24, 2018Updated 7 years ago
Alternatives and similar repositories for up-down-captioner
Users that are interested in up-down-captioner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- vqa drived by bottom-up and top-down attention and knowledge☆14Nov 21, 2018Updated 7 years ago
- ☆14Jan 30, 2017Updated 9 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- Pytorch Implementation of Videos as Space-Time Region Graphs☆27May 30, 2025Updated 10 months ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,466Feb 3, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Dec 17, 2018Updated 7 years ago
- Extension of hLSTMat☆19Apr 15, 2021Updated 5 years ago
- The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…☆16Jun 29, 2017Updated 8 years ago
- Towards Diverse and Natural Image Descriptions via a Conditional GAN☆75Dec 2, 2017Updated 8 years ago
- Study of frame rate effects on MSR-VTT dataset☆14Feb 10, 2018Updated 8 years ago
- Preprocess the activityNet dataset for detection task☆13Mar 3, 2017Updated 9 years ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated 2 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Aug 25, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The implementation of Sequential VLAD in Pytorch☆20Jun 20, 2019Updated 6 years ago
- [ICCVW 2019] PyTorch code for Class Visualization Pyramid for intpreting spatio-temporal class-specific activations throughout the networ…☆22Mar 9, 2020Updated 6 years ago
- Code for Discriminability objective for training descriptive captions(CVPR 2018)☆109Nov 21, 2019Updated 6 years ago
- Code for learning to generate stylized image captions from unaligned text☆62Aug 13, 2022Updated 3 years ago
- Tensorflow implementation of "Dynamic Memory Networks for Visual and Textual Question Answering"☆79Mar 22, 2018Updated 8 years ago
- Codes for paper of "Attention-based LSTM with Semantic Consistency for Videos Captioning "☆18Mar 22, 2017Updated 9 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 8 years ago
- Tensorflow implementation of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs☆15Apr 27, 2018Updated 7 years ago
- An image captioning model that is inspired by the Show, Attend and Tell paper (https://arxiv.org/abs/1502.03044) and the Sequence Generat…☆22Sep 4, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Apr 18, 2019Updated 6 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- The code for domain-robust language identification with adversarial loss☆15May 29, 2018Updated 7 years ago
- Memory-augmented Attention Modelling for Videos☆10Apr 24, 2017Updated 8 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- VAE+GAN☆10Apr 18, 2018Updated 7 years ago
- Video Captioning on MSR-VTT and MSVD dataset using Deep Learning☆21Aug 14, 2020Updated 5 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pytorch implementation of audio-visual fusion video captioning model☆27Jul 26, 2018Updated 7 years ago
- Co-attending Regions and Detections for VQA.☆40Jun 2, 2018Updated 7 years ago
- PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"☆24Apr 8, 2023Updated 3 years ago
- PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"☆11Sep 26, 2021Updated 4 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 7 years ago
- Image Captioning based on Bottom-Up and Top-Down Attention model☆104Jan 3, 2019Updated 7 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Jun 19, 2019Updated 6 years ago