Image Captioning with CNN, LSTM and RNN using PyTorch on COCO Dataset
☆17Mar 8, 2020Updated 6 years ago
Alternatives and similar repositories for image_captioning
Users that are interested in image_captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Dec 7, 2022Updated 3 years ago
- ☆11Mar 24, 2023Updated 3 years ago
- Repo of the paper "Generative Adversarial Networks as an advanced data augmentation technique for MRI data" by Filippos Konidaris, Thanos…☆23Sep 24, 2021Updated 4 years ago
- A collection of publications that works on code models but beyond focusing on the accuracies.☆13Jun 30, 2023Updated 2 years ago
- U-KAN: U-Shape Kolmogorov-Arnold Network for Image Registration☆17May 26, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Machine Learning Operations with a denoising diffusion model using a butterfly dataset☆11Jun 2, 2024Updated last year
- A Pytorch-based library to evaluate learning methods on small image classification datasets☆16Jun 22, 2022Updated 3 years ago
- Game recommendation engine built with React and PyTorch. Facebook Developer Circle Hackathon Local Language Winner.☆17Jun 22, 2021Updated 4 years ago
- A Python tool to visualize the global distribution of your academic citations.☆24Nov 24, 2025Updated 4 months ago
- Machine Learning and Deep Learning models for Anomaly Detection☆10Mar 10, 2019Updated 7 years ago
- Implementation of CarSNN: An Efficient Spiking Neural Network for Event-Based Autonomous Cars on the Loihi Neuromorphi☆15Aug 4, 2021Updated 4 years ago
- This repository supports the BENDER series of videos for the MICCAI Education Challenge, 2022.☆13Oct 6, 2025Updated 5 months ago
- [CVPR 2025] This is the official source for our paper "DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations"☆55Jul 12, 2025Updated 8 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆37May 19, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A collection of resources on bioimage analysis and related tools and techniques☆14Apr 28, 2023Updated 2 years ago
- Learning tutorial for machine learning beginners☆16May 14, 2022Updated 3 years ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆43Feb 12, 2025Updated last year
- This is for the papers I review every week☆14May 20, 2020Updated 5 years ago
- Eagleeye: fast sub-net evaluation for efficient neural network pruning with Tensorflow keras☆18Mar 27, 2021Updated 4 years ago
- 3 Minutes Machine Learning☆16Dec 8, 2022Updated 3 years ago
- image captioning with flikr8k dataset☆14Dec 7, 2021Updated 4 years ago
- [ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments☆43Feb 9, 2026Updated last month
- Implementation of the paper CPTR : FULL TRANSFORMER NETWORK FOR IMAGE CAPTIONING☆31Jun 1, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This method performs 3D object detection in the BEV space using images from multiple cameras.☆32Oct 26, 2022Updated 3 years ago
- 🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025☆36Aug 24, 2025Updated 7 months ago
- Papers from our SoK on Red-Teaming (Accepted at TMLR)☆42Mar 16, 2026Updated last week
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆32Dec 26, 2024Updated last year
- CIRCLe: Color Invariant Representation Learning for Unbiased Classification of Skin Lesions☆19Apr 17, 2023Updated 2 years ago
- [ECCV24] "Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning" by Chongyu Fan*, Jiancheng Liu*, Alfred Hero, …☆25May 27, 2025Updated 9 months ago
- A personal academic website built on top of a Google Sheet document that is super easy to maintain.☆23Dec 10, 2022Updated 3 years ago
- Notes about Computer vision and implementation of image-processing, face-detection, face-recognition, and character optical recognition a…☆17Sep 5, 2022Updated 3 years ago
- Recommender for suggesting letter writers 👍☆34Aug 12, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Person Detection using the EfficientNet B0 and Light Head RCNN running at 12 FPS☆24Sep 20, 2019Updated 6 years ago
- [CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion m…☆67Jun 11, 2024Updated last year
- ☆27Nov 9, 2022Updated 3 years ago
- Sample integration with Deepgram and FastAPI☆30Nov 26, 2025Updated 3 months ago
- 【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?☆258Nov 29, 2024Updated last year
- The MCG black-box attack framework published in TPAMI 2022☆37Jan 17, 2023Updated 3 years ago
- Production Machine Learning Pipeline for Text Classification with fastText☆33Jun 10, 2021Updated 4 years ago