iamirmasoud / image_captioning
Automatic Image Captioning using PyTorch on MS COCO dataset
☆19Updated 2 years ago
Alternatives and similar repositories for image_captioning:
Users that are interested in image_captioning are comparing it to the libraries listed below
- Simple implementation of OpenAI CLIP model in PyTorch.☆650Updated 9 months ago
- Explainability for Vision Transformers☆887Updated 2 years ago
- Paper implementations from scratch and machine learning tutorials☆344Updated last year
- Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning☆2,807Updated 2 years ago
- Personal short implementations of Machine Learning papers☆239Updated last year
- [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize …☆1,827Updated 11 months ago
- Video datasets☆1,288Updated last year
- Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)☆1,978Updated 2 years ago
- The Hitchhiker’s guide in getting a FAANG job (Computer Vision)☆87Updated 2 years ago
- Recent Transformer-based CV and related works.☆1,326Updated last year
- This is an ongoing project of designing a custom object detector from scratch. You can also use the pytorch-lightning training pipeline t…☆9Updated last year
- Image captioning model with Resnet50 encoder and LSTM decoder☆15Updated 4 months ago
- Object Detection Metrics. 14 object detection metrics: mean Average Precision (mAP), Average Recall (AR), Spatio-Temporal Tube Average Pr…☆1,088Updated last year
- Tensorflow implementation of DETR : Object Detection with Transformers☆170Updated 2 years ago
- ☆12Updated last year
- Yolo to COCO annotation format converter☆286Updated last year
- Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.☆459Updated 2 years ago
- In-depth tutorials for implementing deep learning models on your own with PyTorch.☆1,526Updated last year
- A paper list of some recent Transformer-based CV works.☆1,172Updated this week
- Simple image captioning model☆1,335Updated 7 months ago
- Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch☆1,092Updated last year
- Fine-tune Facebook's DETR (DEtection TRansformer) on Colaboratory.☆142Updated last year
- [ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decode…☆814Updated last year
- 🗂 Split folders with files (i.e. images) into training, validation and test (dataset) folders☆417Updated last year
- A best practice for deep learning project template architecture.☆1,309Updated 5 years ago
- This repo implements and trains Vision Transformer (VIT) on a synthetically generated dataset which has colored mnist images on texture b…☆15Updated 11 months ago
- Simple image-captioning model using Flickr8K dataset☆14Updated 2 years ago
- View model summaries in PyTorch!☆2,662Updated this week
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,063Updated last month
- This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.☆160Updated 3 years ago