kalpesh22-21 / Image_Captioning_using_Hugging_FaceLinks

In this project Flikr8K dataset was used to train an Image Captioning model Using Hugging face Transformer.

☆9

Alternatives and similar repositories for Image_Captioning_using_Hugging_Face

Users that are interested in Image_Captioning_using_Hugging_Face are comparing it to the libraries listed below

Sorting:

tanishqgautam / Image-Captioning
Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformer…
☆40Updated 4 years ago
aravindvarier / Image-Captioning-Pytorch
Hyperparameter analysis for Image Captioning using LSTMs and Transformers
☆26Updated last year
VinitSR7 / Image-Caption-Generation
Image Captioning: Implementing the Neural Image Caption Generator
☆21Updated 4 years ago
RoyalSkye / Image-Caption
Using LSTM or Transformer to solve Image Captioning in Pytorch
☆78Updated 3 years ago
dksifoua / Neural-Image-Caption-Generator
In this project, I define and train an image-to-caption model that can produce descriptions for real world images with Flickr-8k dataset.
☆7Updated last year
zarzouram / image_captioning_with_transformers
Pytorch implementation of image captioning using transformer-based model.
☆66Updated 2 years ago
tbmoon / basic_vqa
Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)
☆95Updated last year
Dantekk / Image-Captioning
Image Captioning using CNN and Transformer.
☆54Updated 3 years ago
kaylode / caption-transformer
Image captioning with Transformer
☆14Updated 3 years ago
aimagelab / meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
☆540Updated 2 years ago
yikuan8 / Transformers-VQA
An implementation that downstreams pre-trained V+L models to VQA tasks. Now support: VisualBERT, LXMERT, and UNITER
☆164Updated 2 years ago
abdelhadie-almalla / image_captioning
☆12Updated last year
senadkurtisi / pytorch-image-captioning
Transformer & CNN Image Captioning model in PyTorch.
☆44Updated 2 years ago
ussaema / SeqCapsGAN
Subjective Image Captioning using Capsule Generative Adversarial Network
☆11Updated 4 years ago
ajamjoom / Image-Captions
BERT + Image Captioning
☆132Updated 4 years ago
airsplay / py-bottom-up-attention
PyTorch bottom-up attention with Detectron2
☆233Updated 3 years ago
Shreyz-max / Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
☆137Updated last year
saahiluppal / catr
Image Captioning Using Transformer
☆268Updated 3 years ago
RachanaJayaram / Cross-Attention-VizWiz-VQA
A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …
☆15Updated last year
ntusteeian / VQA_CNN-LSTM
Pytorch implementation of VQA: Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf) using VQA v2.0 dataset for open-ended ta…
☆20Updated 4 years ago
232525 / PureT
Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]
☆67Updated last year
v-iashin / BMT
Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)
☆227Updated 2 years ago
MiteshPuthran / Image-Caption-Generator
The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, De…
☆87Updated 5 years ago
avinashsai / BERT-Aspect
BERT Fine-tuning for Aspect Based Sentiment Analysis
☆28Updated 2 years ago
nasib-ullah / video-captioning-models-in-Pytorch
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
☆73Updated last year
SubhamIO / Image-Captioning-using-Attention-Mechanism-Local-Attention-and-Global-Attention-
Implemented Image Captioning Model using both Local and Global Attention Techniques and API'fied the model using FLASK
☆26Updated 5 years ago
terry-r123 / Awesome-Captioning
A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
☆109Updated 3 years ago
anujshah1003 / VQA-Demo-GUI
This repository gives a GUI using PyQt4 for VQA demo using Keras Deep Learning Library. The VQA model is created using Pre-trained VGG-1…
☆46Updated 4 years ago
salaniz / pycocoevalcap
Python 3 support for the MS COCO caption evaluation tools
☆321Updated 11 months ago
JDAI-CV / image-captioning
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
☆274Updated 3 years ago