imSanko / Image_Caption_Generator_With_TransformersView on GitHub
This repository contains code for generating captions for images using a Transformer-based model. The model used is the `VisionEncoderDecoderModel` from the Hugging Face Transformers library, specifically the `nlpconnect/vit-gpt2-image-captioning` model.
12Sep 2, 2024Updated last year

Alternatives and similar repositories for Image_Caption_Generator_With_Transformers

Users that are interested in Image_Caption_Generator_With_Transformers are comparing it to the libraries listed below

Sorting:

Are these results useful?