senadkurtisi/pytorch-image-captioning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/senadkurtisi/pytorch-image-captioning)

senadkurtisi / pytorch-image-captioning

Transformer & CNN Image Captioning model in PyTorch.

☆44

Alternatives and similar repositories for pytorch-image-captioning

Users that are interested in pytorch-image-captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aravindvarier / Image-Captioning-Pytorch
View on GitHub
Hyperparameter analysis for Image Captioning using LSTMs and Transformers
☆26Oct 3, 2023Updated 2 years ago
senadkurtisi / IMDB-Sentiment-Analysis-PyTorch
View on GitHub
Neural Network for classifying movie reviews as positive/negative using IMDB dataset
☆12Feb 2, 2021Updated 5 years ago
tatwan / image-captioning-pytorch
View on GitHub
Image Captioning using CNN+RNN Encoder-Decoder Architecture in PyTorch
☆24Feb 9, 2021Updated 5 years ago
milkymap / transformer-image-captioning
View on GitHub
Implementation of the paper CPTR : FULL TRANSFORMER NETWORK FOR IMAGE CAPTIONING
☆30Jun 1, 2022Updated 4 years ago
UbiBelETF / dagger
View on GitHub
A fully-featured, modern game engine made for educational purposes.
☆11Feb 28, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
senadkurtisi / pytorch-GCN
View on GitHub
PyTorch implementation of the Graph Convolutional Network by Kipf et al.
☆29Feb 25, 2021Updated 5 years ago
yuanxiaosc / Image-Captioning
View on GitHub
CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述
☆36Jun 30, 2019Updated 7 years ago
quangvnai / grit
View on GitHub
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
☆199May 9, 2023Updated 3 years ago
pranoyr / scene-graph-vit
View on GitHub
Implementation of the Paper Scene-Graph ViT
☆10Dec 20, 2024Updated last year
HJYao00 / R1-ShareVL
View on GitHub
[NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward
☆38Sep 19, 2025Updated 10 months ago
gordicaleksa / gordicaleksa
View on GitHub
GitHub's new feature: repo with the same name as your GitHub name initialized with README.md will show on your landing page!
☆12May 30, 2026Updated last month
Adit31 / Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning
View on GitHub
Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
☆13Jun 26, 2023Updated 3 years ago
MathGaron / pytorch_toolbox
View on GitHub
Boiler plate code for pytorch. Train/Validation loops, visualization etc. For research.
☆11Jul 25, 2024Updated 2 years ago
luguoqing / Diff-FSPM
View on GitHub
Mining Frequent Sequential Patterns under Differential Privacy
☆16May 22, 2014Updated 12 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kgkgzrtk / cUNet-Pytorch
View on GitHub
conditional U-Net Pytorch Implementation
☆11Jul 27, 2021Updated 5 years ago
rdisipio / gpt-q
View on GitHub
Quantum-enhanced GPT-2
☆15Mar 19, 2024Updated 2 years ago
w5688414 / EfficientNet-ViolenceDetection
View on GitHub
an improvement of the paper: Learning to Detect Violent Videos using Convolution LSTM
☆11Jun 1, 2020Updated 6 years ago
shenmishajing / lifted
View on GitHub
☆16Jul 25, 2024Updated 2 years ago
mhauskn / pytorch_attention
View on GitHub
Pytorch implementation of Bahdanau attention.
☆13Oct 13, 2020Updated 5 years ago
matfax / spmf
View on GitHub
Fork of the SPMF Open-Source Data Mining Library from Prof. Philippe Fournier-Viger
☆17Jun 27, 2026Updated last month
Subangkar / Image-Captioning-Attention-PyTorch
View on GitHub
An attention based sequential deep learning model implemented in pytorch to generate single line caption given an input image
☆11Dec 29, 2020Updated 5 years ago
makarovartyom / Image-Captioning-with-Attention
View on GitHub
Repository contains Python code for image pre-processing and captioning with Deep learning model
☆15Dec 8, 2020Updated 5 years ago
fujiso / SODA
View on GitHub
SODA: Story Oriented Dense Video Captioning Evaluation Framework
☆14May 3, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ledormeurduval / PrefixSpan
View on GitHub
Sequential Pattern Mining - PrefixSpan - Fitting the implementation of Tianming Lu with the Spark MLlib Python API
☆13Oct 15, 2016Updated 9 years ago
particle1331 / lizard-torch-course
View on GitHub
Notebooks for the PyTorch course by @deeplizard.
☆16Dec 3, 2019Updated 6 years ago
adesgautam / clip-search
View on GitHub
A search engine implementation using OpenAI's clip model
☆10Jun 20, 2021Updated 5 years ago
calisolo / Levels_image_captioning_NICE
View on GitHub
NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU
☆11Jun 22, 2023Updated 3 years ago
Luo-Z13 / GLH-Bridge-page
View on GitHub
[TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery
☆15Mar 18, 2025Updated last year
taseikyo / PremiereSubtitle
View on GitHub
Premiere subtitles generator | Pr 字幕批量生成器
☆26Oct 14, 2019Updated 6 years ago
AmritK10 / Image_Captioning
View on GitHub
Image Captioning using LSTM and Deep Learning on Flickr8K dataset.
☆15Feb 1, 2022Updated 4 years ago
filick / GRU-RCN
View on GitHub
Video classification using convGRU
☆13Feb 15, 2018Updated 8 years ago
senadkurtisi / Multivariate-Time-Series-Forecast
View on GitHub
Forecast of the level of pollution in the next hour in Beijing based on historical information
☆17Nov 12, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kyegomez / AudioMamba
View on GitHub
Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch
☆15Updated this week
n-gauhar / 3D-bedroom
View on GitHub
A bedroom designed using OpenGL (gl, glu, glut libraries). Lighting is applied. No texture is applied.
☆14Jun 18, 2021Updated 5 years ago
KatherLab / prompt_injection_attacks
View on GitHub
☆14Dec 28, 2024Updated last year
CharlesGong12 / FreeInpaint
View on GitHub
[AAAI 2026] FreeInpaint: Tuning-free Prompt Alignment and Visual Rationality Enhancement in Image Inpainting
☆18Dec 30, 2025Updated 6 months ago
randombenj / detectron2onnx-inference
View on GitHub
Export [detectron2](https://github.com/facebookresearch/detectron2) model to [onnx](https://github.com/onnx/onnx) and run inference using…
☆18Mar 18, 2021Updated 5 years ago
daveredrum / image-captioning
View on GitHub
Image captioning models "show and tell" + "show, attend and tell" in PyTorch
☆19Jul 19, 2018Updated 8 years ago
ibliever / Cross-modal-information-fusion-for-voice-spoofing-detection
View on GitHub
This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"
☆13Jun 5, 2023Updated 3 years ago