Image Captioning with CNN, LSTM and RNN using PyTorch on COCO Dataset
☆17Mar 8, 2020Updated 6 years ago
Alternatives and similar repositories for image_captioning
Users that are interested in image_captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Dec 7, 2022Updated 3 years ago
- Using a CNN-LSTM hybrid network to generate captions for images☆18Nov 19, 2019Updated 6 years ago
- ☆13Feb 6, 2025Updated last year
- A collection of publications that works on code models but beyond focusing on the accuracies.☆13Jun 30, 2023Updated 2 years ago
- White-box Fairness Testing through Adversarial Sampling☆14Apr 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16Mar 8, 2024Updated 2 years ago
- Machine Learning Operations with a denoising diffusion model using a butterfly dataset☆11Jun 2, 2024Updated last year
- U-KAN: U-Shape Kolmogorov-Arnold Network for Image Registration☆17May 26, 2024Updated last year
- Image Captioning Model Implemented in PyTorch using CNN followed by LSTM☆13Apr 5, 2018Updated 8 years ago
- Game recommendation engine built with React and PyTorch. Facebook Developer Circle Hackathon Local Language Winner.☆17Jun 22, 2021Updated 4 years ago
- Implementation of MobileViT in TensorFlow and Keras☆13Nov 16, 2022Updated 3 years ago
- Using keras2.2.4 to pruning VGG16☆12Jan 4, 2019Updated 7 years ago
- Implementation of CarSNN: An Efficient Spiking Neural Network for Event-Based Autonomous Cars on the Loihi Neuromorphi☆15Aug 4, 2021Updated 4 years ago
- Code for "Towards Interpretable Skin Lesion Classification with Deep Learning Models"☆12Jan 20, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2025] This is the official source for our paper "DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations"☆58Jul 12, 2025Updated 9 months ago
- The official SpeakerVid-5M data curation code.☆71Jul 23, 2025Updated 8 months ago
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆43Jun 9, 2024Updated last year
- A collection of resources on bioimage analysis and related tools and techniques☆14Apr 28, 2023Updated 2 years ago
- KAN-based Fusion of Dual Domain for Audio-Driven Landmarks Generation of the model can help you generate an sequence of facial lanmarks f…☆30Oct 28, 2025Updated 5 months ago
- This is for the papers I review every week☆14May 20, 2020Updated 5 years ago
- Eagleeye: fast sub-net evaluation for efficient neural network pruning with Tensorflow keras☆18Mar 27, 2021Updated 5 years ago
- 3 Minutes Machine Learning☆16Dec 8, 2022Updated 3 years ago
- ☆30Aug 14, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- image captioning with flikr8k dataset☆14Dec 7, 2021Updated 4 years ago
- ☆17May 23, 2023Updated 2 years ago
- Implementation of the paper CPTR : FULL TRANSFORMER NETWORK FOR IMAGE CAPTIONING☆31Jun 1, 2022Updated 3 years ago
- [ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments☆47Feb 9, 2026Updated 2 months ago
- This method performs 3D object detection in the BEV space using images from multiple cameras.☆32Oct 26, 2022Updated 3 years ago
- Papers from our SoK on Red-Teaming (Accepted at TMLR)☆42Mar 24, 2026Updated 3 weeks ago
- Deploy Python FastAPI serverless application on Azure Functions☆21May 31, 2024Updated last year
- ☆54Jan 17, 2026Updated 2 months ago
- An LSTM template and a few examples using Vivado HLS☆47May 4, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A personal academic website built on top of a Google Sheet document that is super easy to maintain.☆23Dec 10, 2022Updated 3 years ago
- Undergraduate Dissertation: Content-based video retrieval prototype for movies written in Python using OpenCV.☆16Jul 28, 2023Updated 2 years ago
- Notes about Computer vision and implementation of image-processing, face-detection, face-recognition, and character optical recognition a…☆17Sep 5, 2022Updated 3 years ago
- Recommender for suggesting letter writers 👍☆34Aug 12, 2024Updated last year
- This project is focused on the Deployment phase of machine learning. The Docker and FastAPI are used to deploy a dockerized server of tra…☆27Jan 7, 2023Updated 3 years ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆65Jun 19, 2024Updated last year
- [CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion m…☆67Jun 11, 2024Updated last year