zyj0021200 / simpleImageCaptionZooLinks

Simple but Comprehensive PyTorch Implementation of Image Captioning Models.

☆13

Alternatives and similar repositories for simpleImageCaptionZoo

Users that are interested in simpleImageCaptionZoo are comparing it to the libraries listed below

Sorting:

CCYChongyanChen / VQA_AlgorithmDatasets
☆38Updated 2 years ago
ezeli / BUTD_model
A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.
☆47Updated 3 years ago
BierOne / bottom-up-attention-vqa
An updated PyTorch implementation of hengyuan-hu's version for 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question…
☆36Updated 3 years ago
qinzzz / Multimodal-Alignment-Framework
Implementation for MAF: Multimodal Alignment Framework
☆46Updated 4 years ago
entalent / MemCap
code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`
☆11Updated 5 years ago
hobincar / SGN
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Updated 3 years ago
HLR / Cross_Modality_Relevance
The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"
☆27Updated 4 years ago
ezeli / bottom_up_features_extract
An PyTorch reimplementation of bottom-up-attention models
☆16Updated 4 years ago
Gitsamshi / WeakVRD-Captioning
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆32Updated 4 years ago
ruotianluo / coco-caption
☆67Updated 2 years ago
MILVLG / mt-captioning
A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning
☆25Updated 4 years ago
ussaema / SeqCapsGAN
Subjective Image Captioning using Capsule Generative Adversarial Network
☆11Updated 4 years ago
yangbang18 / Non-Autoregressive-Video-Captioning
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
☆58Updated last year
ezeli / Transformer_model
A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.
☆12Updated 3 years ago
LibertFan / TCIC
TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning in IJCAI2021.
☆9Updated 3 years ago
luo3300612 / image-captioning-DLCT
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
☆200Updated 3 years ago
MILVLG / mmnas
Deep Multimodal Neural Architecture Search
☆28Updated 4 years ago
sks3i / pycocoevalcap
Microsoft COCO Caption Evaluation Tool - Python 3
☆33Updated 6 years ago
LibertFan / ImageCaption
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019
☆17Updated 5 years ago
YiwuZhong / Sub-GC
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
☆97Updated 10 months ago
zhangxuying1004 / RSTNet
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
☆123Updated 2 years ago
GT-RIPL / Xmodal-Ctx
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …
☆59Updated 2 years ago
CrossmodalGroup / SSL-VQA
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
☆51Updated 4 years ago
JDAI-CV / image-captioning
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
☆274Updated 3 years ago
li-xirong / cross-lingual-cap
Cross-lingual image captioning
☆87Updated 3 years ago
luo3300612 / Transformer-Captioning
Optimized code based on M2 for faster image captioning training
☆21Updated 2 years ago
cshizhe / asg2cap
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …
☆199Updated 2 years ago
Zhiquan-Wen / D-VQA
PyTorch implementation of "Debiased Visual Question Answering from Feature and Sample Perspectives" (NeurIPS 2021)
☆25Updated 2 years ago
husthuaan / AAT
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Updated 5 years ago
luo3300612 / Semantics-AssistedVideoCaptioning.pytorch
pytorch implementation of Semantics-AssistedVideoCaptioning
☆11Updated 2 years ago