ksheersaagr / Automatic-Image-Captioning
A neural network architecture(CNN+LSTM) that automatically generates captions from the images. The model uses ResNet architecture to train the Encoder while DecoderRNN has to be trained with our choice of trainable parameters. I have trained the model on the Microsoft Common Objects in COntext (MS COCO) dataset and have tested the network on fic…
☆25Updated 5 years ago
Alternatives and similar repositories for Automatic-Image-Captioning:
Users that are interested in Automatic-Image-Captioning are comparing it to the libraries listed below
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21Updated 3 years ago
- CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering☆75Updated 5 years ago
- Image Captioning with Keras☆63Updated 4 years ago
- BERT + Image Captioning☆132Updated 4 years ago
- Neural Machine Translation with Attention (PyTorch)☆45Updated 6 years ago
- PyTorch Tutorial on google colaboratory.☆76Updated 5 years ago
- A Pytorch tutorial for implementation of Dynamic memory Network Plus☆64Updated 6 years ago
- Convolutional Neural Networks for Visual Recognition☆18Updated 7 years ago
- ☆71Updated 6 years ago
- A PyTorch implementation of the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention☆85Updated 5 years ago
- Machine Translation using Transfromers☆29Updated 5 years ago
- PyTorch implementation of DRAW: A Recurrent Neural Network For Image Generation trained on Devanagari dataset.☆89Updated 4 years ago
- Code for our paper: *Shamsian, *Kleinfeld, Globerson & Chechik, "Learning Object Permanence from Video"☆68Updated 5 months ago
- Example of a Cover letter for AI Residency☆80Updated 5 years ago
- code for running trained model from Visual Reasoning by Progressive Module Networks (ICLR19)☆15Updated 6 years ago
- PyTorch VQA implementation that achieved top performances in the (ECCV18) VizWiz Grand Challenge: Answering Visual Questions from Blind P…☆60Updated 6 years ago
- This repository contains the tensorflow implementation and models for DAN - CVPR 2017 paper☆22Updated 6 years ago
- Model submitted for the ICMI 2018 EmotiW Group-Level Emotion Recognition Challenge☆79Updated 6 years ago
- An easy-to-use app to visualise attentions of various VQA models.☆41Updated 2 years ago
- Some notes, tutorials, and some experimentation with the fast.ai library (https://github.com/fastai/fastai)☆58Updated 6 years ago
- Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".☆75Updated last year
- Interface for people to use my model which predicts which techniques one should use to solve a competitive programming problem to get an …☆147Updated 5 years ago
- Implementation of modern data augmentation techniques in TensorFlow 2.x to be used in your training pipeline.☆34Updated 4 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15Updated 3 years ago
- Starter code in PyTorch for the Visual Dialog challenge☆192Updated 2 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆32Updated 5 years ago
- ☆37Updated 7 years ago
- making use of (Language model + Image model) to generate captions on flickr images. CNN + LSTM + Transfer learning☆20Updated 7 years ago
- Official Tensorflow implementation of drl-RPN: Deep Reinforcement Learning of Region Proposal Networks (CVPR 2018 paper)☆79Updated last year
- generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset☆78Updated 6 years ago