AmirhosseinHonardoust / Image-Captioning-CNN-LSTMLinks

An end-to-end image captioning project using a CNN encoder (ResNet-50) and LSTM decoder in PyTorch. Includes vocabulary building, preprocessing, training with BLEU evaluation, and inference. Generates natural language captions for images with saved metrics, model checkpoints, and visualization outputs.
โ˜†21Updated last month

Alternatives and similar repositories for Image-Captioning-CNN-LSTM

Users that are interested in Image-Captioning-CNN-LSTM are comparing it to the libraries listed below

Sorting: