Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"
☆29Oct 24, 2018Updated 7 years ago
Alternatives and similar repositories for up-down-captioner
Users that are interested in up-down-captioner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- vqa drived by bottom-up and top-down attention and knowledge☆14Nov 21, 2018Updated 7 years ago
- ☆14Jan 30, 2017Updated 9 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- Pytorch Implementation of Videos as Space-Time Region Graphs☆27May 30, 2025Updated 9 months ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,467Feb 3, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Dec 17, 2018Updated 7 years ago
- Extension of hLSTMat☆19Apr 15, 2021Updated 4 years ago
- The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…☆16Jun 29, 2017Updated 8 years ago
- Towards Diverse and Natural Image Descriptions via a Conditional GAN☆75Dec 2, 2017Updated 8 years ago
- Study of frame rate effects on MSR-VTT dataset☆14Feb 10, 2018Updated 8 years ago
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Aug 25, 2020Updated 5 years ago
- The implementation of Sequential VLAD in Pytorch☆20Jun 20, 2019Updated 6 years ago
- [ICCVW 2019] PyTorch code for Class Visualization Pyramid for intpreting spatio-temporal class-specific activations throughout the networ…☆22Mar 9, 2020Updated 6 years ago
- Code for Discriminability objective for training descriptive captions(CVPR 2018)☆109Nov 21, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for learning to generate stylized image captions from unaligned text☆62Aug 13, 2022Updated 3 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆766Mar 10, 2024Updated 2 years ago
- Tensorflow implementation of "Dynamic Memory Networks for Visual and Textual Question Answering"☆79Mar 22, 2018Updated 8 years ago
- Codes for paper of "Attention-based LSTM with Semantic Consistency for Videos Captioning "☆18Mar 22, 2017Updated 9 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 8 years ago
- Tensorflow implementation of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs☆15Apr 27, 2018Updated 7 years ago
- An image captioning model that is inspired by the Show, Attend and Tell paper (https://arxiv.org/abs/1502.03044) and the Sequence Generat…☆22Sep 4, 2020Updated 5 years ago
- Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"☆66Apr 18, 2019Updated 6 years ago
- Evaluation code for Dense-Captioning Events in Videos☆130Jun 11, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- The code for domain-robust language identification with adversarial loss☆15May 29, 2018Updated 7 years ago
- Video Captioning on MSR-VTT and MSVD dataset using Deep Learning☆21Aug 14, 2020Updated 5 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- Pytorch implementation of audio-visual fusion video captioning model☆27Jul 26, 2018Updated 7 years ago
- a sketch-based system for semantic parsing☆10Nov 21, 2022Updated 3 years ago
- Co-attending Regions and Detections for VQA.☆40Jun 2, 2018Updated 7 years ago
- Visual Bidirectional Kernelized Network for Visual Question Answering☆11Jul 17, 2017Updated 8 years ago
- PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"☆11Sep 26, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- PyTorch implementation of paper: "Self-critical Sequence Training for Image Captioning"☆24Apr 8, 2023Updated 2 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 7 years ago
- Image Captioning based on Bottom-Up and Top-Down Attention model☆104Jan 3, 2019Updated 7 years ago
- transform H&E image into CK image using GAN (virtual staining)☆12Nov 24, 2021Updated 4 years ago
- ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》☆36Jun 19, 2019Updated 6 years ago
- ☆15Jul 23, 2019Updated 6 years ago
- Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)☆135Jul 25, 2024Updated last year