A neural network architecture(CNN+LSTM) that automatically generates captions from the images. The model uses ResNet architecture to train the Encoder while DecoderRNN has to be trained with our choice of trainable parameters. I have trained the model on the Microsoft Common Objects in COntext (MS COCO) dataset and have tested the network on fic…
☆25Jan 13, 2020Updated 6 years ago
Alternatives and similar repositories for Automatic-Image-Captioning
Users that are interested in Automatic-Image-Captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- ☆10Apr 20, 2018Updated 8 years ago
- Image Caption workout with NIC and NBT☆16Apr 5, 2019Updated 7 years ago
- Adversarial Machine Translation with pytorch☆23Jan 14, 2018Updated 8 years ago
- VQA - Visual Question Answering☆14Nov 13, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Aug 6, 2018Updated 7 years ago
- Probabilistic line search algorithm for stochastic optimization with a TensorFlow interface.☆21Jul 27, 2017Updated 8 years ago
- ☆22Oct 14, 2019Updated 6 years ago
- Reimplementation of ECCV paper "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis" with PyTorch Library.☆38Apr 7, 2022Updated 4 years ago
- Application of rnn-gan to machine translation☆18Jun 5, 2019Updated 7 years ago
- A Test-Implementation of the IMPALA algorithm (by deepmind 2018)☆35Mar 16, 2018Updated 8 years ago
- Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint☆16Aug 8, 2022Updated 3 years ago
- Repository for studying distributional rl☆30Feb 2, 2025Updated last year
- SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks☆14Mar 2, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- custom object detection tutorial with tensorflow object detection api☆22May 19, 2018Updated 8 years ago
- Implementation of 3D reconstruction from accidental motion, CVPR 2014☆12Dec 8, 2022Updated 3 years ago
- This repository contains the source code, models and data files for the work titled: "Unsupervised Image Style Embeddings for Retrieval a…☆13May 29, 2021Updated 5 years ago
- https://github.com/mitsuba-renderer/mitsuba2 in docker☆10Jun 13, 2020Updated 5 years ago
- Pytorch implementation of audio-visual fusion video captioning model☆27Jul 26, 2018Updated 7 years ago
- ☆14Oct 24, 2023Updated 2 years ago
- ☆10Jul 27, 2021Updated 4 years ago
- tensorflow object detection api helper tool ( custom object detection )☆30May 20, 2018Updated 8 years ago
- ☆10Apr 11, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Video clipping by face recognition☆28Dec 10, 2018Updated 7 years ago
- Code for the DataPipes article☆15Jun 14, 2022Updated 3 years ago
- Reading list for research topics in multimodal machine learning☆10Mar 14, 2023Updated 3 years ago
- RNN语义分割+KinectFusion=3 d Semantic Scene☆16Mar 11, 2018Updated 8 years ago
- ☆21Apr 12, 2022Updated 4 years ago
- BERT + Image Captioning☆135Jan 8, 2021Updated 5 years ago
- Implementations of the XNOR networks☆12Aug 9, 2017Updated 8 years ago
- Imagenet Pretraining for Covid-19 Xray Identification☆10Apr 5, 2020Updated 6 years ago
- Examples of Generative Adversarial Networks built using torchgan☆12Jun 11, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- fish eye correct☆19Mar 20, 2015Updated 11 years ago
- ☆17Mar 23, 2023Updated 3 years ago
- Code for ICLR 2019 Paper, "MAX-MIG: AN INFORMATION THEORETIC APPROACH FOR JOINT LEARNING FROM CROWDS"☆25Jun 6, 2023Updated 3 years ago
- A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on anal…☆40Nov 26, 2025Updated 6 months ago
- [ECCV2022] "Identity-Aware Hand Mesh Estimation and Personalization from RGB Images".☆44Mar 5, 2023Updated 3 years ago
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- implementation of http://arxiv.org/pdf/1511.06391v4.pdf in keras☆13Oct 3, 2016Updated 9 years ago