Wentong-DST/up-down-captioner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Wentong-DST/up-down-captioner)

Wentong-DST / up-down-captioner

Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"

☆29

Alternatives and similar repositories for up-down-captioner

Users that are interested in up-down-captioner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zongshenmu / attention_knowledge_vqa
View on GitHub
vqa drived by bottom-up and top-down attention and knowledge
☆14Nov 21, 2018Updated 7 years ago
hehefan / Video-Captioning
View on GitHub
☆14Jan 30, 2017Updated 9 years ago
fkxssaa / Deliberate-Attention-Networks-for-Image-Captioning
View on GitHub
Deliberate Attention Networks for Image Captioning (AAAI 2019)
☆11Sep 30, 2019Updated 6 years ago
jd730 / STRG
View on GitHub
Pytorch Implementation of Videos as Space-Time Region Graphs
☆27Jul 17, 2026Updated last week
peteanderson80 / bottom-up-attention
View on GitHub
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
☆1,470Feb 3, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sususushi / reconstruction-network-for-video-captioning
View on GitHub
☆16Dec 17, 2018Updated 7 years ago
lixiangpengcs / Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
View on GitHub
Extension of hLSTMat
☆19Apr 15, 2021Updated 5 years ago
zhaoluffy / hLSTMat
View on GitHub
The paper of "Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning" accepted in International Joint Conference on Arti…
☆16Jun 29, 2017Updated 9 years ago
doubledaibo / gancaption_iccv2017
View on GitHub
Towards Diverse and Natural Image Descriptions via a Conditional GAN
☆75Dec 2, 2017Updated 8 years ago
Curious-Geek / Video-Captioning
View on GitHub
Study of frame rate effects on MSR-VTT dataset
☆14Feb 10, 2018Updated 8 years ago
gurkirt / preprocess-activityNet
View on GitHub
Preprocess the activityNet dataset for detection task
☆13Mar 3, 2017Updated 9 years ago
ARiSE-Lab / CYCLE_OOPSLA_24
View on GitHub
Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"
☆10Mar 8, 2024Updated 2 years ago
MILVLG / mt-captioning
View on GitHub
A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning
☆25Sep 4, 2020Updated 5 years ago
forwchen / HVTG
View on GitHub
Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"
☆17Aug 25, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
youjiangxu / seqvlad-pytorch
View on GitHub
The implementation of Sequential VLAD in Pytorch
☆20Jun 20, 2019Updated 7 years ago
computationalmedia / semstyle
View on GitHub
Code for learning to generate stylized image captions from unaligned text
☆62Aug 13, 2022Updated 3 years ago
ruotianluo / DiscCaptioning
View on GitHub
Code for Discriminability objective for training descriptive captions(CVPR 2018)
☆109Nov 21, 2019Updated 6 years ago
hengyuan-hu / bottom-up-attention-vqa
View on GitHub
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
☆768Mar 10, 2024Updated 2 years ago
DeepRNN / visual_question_answering
View on GitHub
Tensorflow implementation of "Dynamic Memory Networks for Visual and Textual Question Answering"
☆79Mar 22, 2018Updated 8 years ago
Anjaney1999 / image-captioning-seqgan
View on GitHub
An image captioning model that is inspired by the Show, Attend and Tell paper (https://arxiv.org/abs/1502.03044) and the Sequence Generat…
☆22Sep 4, 2020Updated 5 years ago
zhaoluffy / aLSTMs
View on GitHub
Codes for paper of "Attention-based LSTM with Semantic Consistency for Videos Captioning "
☆18Mar 22, 2017Updated 9 years ago
wssun / PromptCS
View on GitHub
A Prompt Learning Framework for Source Code Summarization
☆14Dec 26, 2023Updated 2 years ago
doubledaibo / clcaption_nips2017
View on GitHub
Contrastive Learning for Image Captioning
☆51Feb 22, 2018Updated 8 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ranjaykrishna / densevid_eval
View on GitHub
Evaluation code for Dense-Captioning Events in Videos
☆130Jun 11, 2019Updated 7 years ago
rakshithShetty / captionGAN
View on GitHub
Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"
☆66Apr 18, 2019Updated 7 years ago
vanewu / Structured-Self-Attentive-Sentence-Embedding
View on GitHub
This is an implementation of the paper [A Structured Self-Attentive Sentence Embedding], using Mxnet/Gluon. Finally, the experiment was …
☆13Apr 15, 2019Updated 7 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
lrank / Domain_Robust_Text_Representation
View on GitHub
The code for domain-robust language identification with adversarial loss
☆15May 29, 2018Updated 8 years ago
rasoolfa / videocap
View on GitHub
Memory-augmented Attention Modelling for Videos
☆10Apr 24, 2017Updated 9 years ago
Wentong-DST / im2p
View on GitHub
Tensorflow implementation of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs
☆15Apr 27, 2018Updated 8 years ago
P-Song / HYDRA
View on GitHub
HYDRA: Hybrid deep magnetic resonance fingerprinting
☆12Jun 16, 2020Updated 6 years ago
edchengg / VAE_GAN
View on GitHub
VAE+GAN
☆10Apr 18, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ayouboumani / image-captioning-with-attention
View on GitHub
A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'
☆10Jan 20, 2020Updated 6 years ago
ZhouYao0627 / deep-learning-for-image-processing-master
View on GitHub
☆12May 25, 2023Updated 3 years ago
oddguan / Audio-Visual-Video-Caption
View on GitHub
Pytorch implementation of audio-visual fusion video captioning model
☆27Jul 26, 2018Updated 7 years ago
lupantech / dual-mfa-vqa
View on GitHub
Co-attending Regions and Detections for VQA.
☆40Jun 2, 2018Updated 8 years ago
salesforce / BiST
View on GitHub
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Jun 16, 2025Updated last year
tttyuntian / vlm_lexical_grounding
View on GitHub
PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"
☆11Sep 26, 2021Updated 4 years ago
MarcBS / VIBIKNet
View on GitHub
Visual Bidirectional Kernelized Network for Visual Question Answering
☆11Jul 17, 2017Updated 9 years ago