jsoft88 / cptr-vision-transformerLinks

Implementation of the CPTR model by https://arxiv.org/pdf/2101.10804.pdf

☆11

Alternatives and similar repositories for cptr-vision-transformer

Users that are interested in cptr-vision-transformer are comparing it to the libraries listed below

Sorting:

zarzouram / image_captioning_with_transformers
Pytorch implementation of image captioning using transformer-based model.
☆66Updated 2 years ago
RoyalSkye / Image-Caption
Using LSTM or Transformer to solve Image Captioning in Pytorch
☆78Updated 3 years ago
krasserm / fairseq-image-captioning
Transformer-based image captioning extension for pytorch/fairseq
☆317Updated 4 years ago
saahiluppal / catr
Image Captioning Using Transformer
☆268Updated 3 years ago
milkymap / transformer-image-captioning
Implementation of the paper CPTR : FULL TRANSFORMER NETWORK FOR IMAGE CAPTIONING
☆30Updated 3 years ago
senadkurtisi / pytorch-image-captioning
Transformer & CNN Image Captioning model in PyTorch.
☆44Updated 2 years ago
aimagelab / meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
☆538Updated 2 years ago
tanishqgautam / Image-Captioning
Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformer…
☆40Updated 4 years ago
ajamjoom / Image-Captions
BERT + Image Captioning
☆133Updated 4 years ago
Dantekk / Image-Captioning
Image Captioning using CNN and Transformer.
☆53Updated 3 years ago
gchhablani / multilingual-vqa
Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.
☆34Updated 3 years ago
davidnvq / grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
☆193Updated 2 years ago
LuoweiZhou / VLP
Vision-Language Pre-training for Image Captioning and Question Answering
☆419Updated 3 years ago
232525 / PureT
Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]
☆67Updated last year
yikuan8 / Transformers-VQA
An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER
☆164Updated 2 years ago
SjokerLily / awesome-image-captioning
A paper list of image captioning.
☆22Updated 3 years ago
airsplay / py-bottom-up-attention
PyTorch bottom-up attention with Detectron2
☆233Updated 3 years ago
pzzhang / VinVL
project page for VinVL
☆355Updated last year
suicao / ai4code-baseline
Early solution for Google AI4Code competition
☆76Updated 3 years ago
JDAI-CV / image-captioning
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
☆274Updated 3 years ago
aravindvarier / Image-Captioning-Pytorch
Hyperparameter analysis for Image Captioning using LSTMs and Transformers
☆26Updated last year
YoadTew / zero-shot-image-to-text
Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
☆276Updated 2 years ago
BierOne / bottom-up-attention-vqa
An updated PyTorch implementation of hengyuan-hu's version for 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question…
☆36Updated 3 years ago
tbmoon / basic_vqa
Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)
☆95Updated last year
DavidHuji / CapDec
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
☆197Updated last year
j-min / VL-T5
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
☆371Updated last year
yahoo / object_relation_transformer
Implementation of the Object Relation Transformer for Image Captioning
☆178Updated 9 months ago
jchenghu / ExpansionNet_v2
Implementation code of the work "Exploiting Multiple Sequence Lengths in Fast End to End Training for Image Captioning"
☆92Updated 6 months ago
shreydan / VisionGPT2
Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.
☆43Updated last year
luo3300612 / image-captioning-DLCT
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
☆200Updated 3 years ago