SatyamGaba / image_captioning
Image Captioning with CNN, LSTM and RNN using PyTorch on COCO Dataset
☆16Updated 5 years ago
Alternatives and similar repositories for image_captioning
Users that are interested in image_captioning are comparing it to the libraries listed below
Sorting:
- Image Captioning Vision Transformers (ViTs) are transformer models that generate descriptive captions for images by combining the power o…☆36Updated 7 months ago
- An unofficial implementation of the CVPR 2020 paper Multimodal Categorization of Crisis Events in Social Media☆16Updated 3 years ago
- Pytorch implementation of image captioning using transformer-based model.☆66Updated 2 years ago
- Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformer…☆41Updated 4 years ago
- Image Captioning using CNN+RNN Encoder-Decoder Architecture in PyTorch☆23Updated 4 years ago
- Simple image-captioning model using Flickr8K dataset☆15Updated 3 years ago
- Transformer & CNN Image Captioning model in PyTorch.☆43Updated 2 years ago
- Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]☆67Updated 11 months ago
- PyTorch port of models for Visual Sentiment Analysis pre-trained on the T4SA dataset.☆42Updated 6 months ago
- Squeeze and Excitation network implementation.☆18Updated 5 years ago
- CNN LSTM architecture implemented in Pytorch for Video Classification☆283Updated 2 years ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆191Updated 2 years ago
- Attention-Based Convolutional Neural Network for Weakly Labeled Human Activities’ Recognition With Wearable Sensors☆12Updated 4 years ago
- Image Captioning using CNN and Transformer.☆52Updated 3 years ago
- Kaggle RSNA Pneumonia Detection Challenge☆15Updated 6 years ago
- ☆66Updated 4 years ago
- Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and i…☆41Updated 2 years ago
- Image Captioning Using Transformer☆268Updated 2 years ago
- Implementation of the paper CPTR : FULL TRANSFORMER NETWORK FOR IMAGE CAPTIONING☆30Updated 2 years ago
- Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"☆213Updated 2 years ago
- Deep Neural Networks for Video Classification☆48Updated 2 years ago
- Using LSTM or Transformer to solve Image Captioning in Pytorch☆76Updated 3 years ago
- Hyperparameter analysis for Image Captioning using LSTMs and Transformers☆26Updated last year
- Implementation for paper "Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Model"☆70Updated 5 months ago
- Paper implementation☆14Updated 4 years ago
- SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation☆108Updated last year
- Implement Human Activity Recognition in PyTorch using hybrid of LSTM, Bi-dir LSTM and Residual Network Models☆15Updated 5 years ago
- Solves a kaggle problem of State Farm Distracted Driver Detection☆54Updated 9 months ago
- Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)☆96Updated last year
- In this repository, a simple implementation of Video augmentation is provided to augment videos for machine learning training tasks.☆21Updated 5 months ago