yurayli / image-caption-pytorchLinks
image captioning with flikr8k dataset
☆14Updated 3 years ago
Alternatives and similar repositories for image-caption-pytorch
Users that are interested in image-caption-pytorch are comparing it to the libraries listed below
Sorting:
- Image Captioning Using Transformer☆268Updated 3 years ago
- A unified framework to jointly model images, text, and human attention traces.☆78Updated 4 years ago
- Image Captioning through Image Transformer☆40Updated 4 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆85Updated 2 years ago
- Hyperparameter analysis for Image Captioning using LSTMs and Transformers☆26Updated last year
- A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-…☆112Updated 3 years ago
- [kaggle] 3rd place solution☆32Updated 4 years ago
- 1st Place Solution in Google Universal Image Embedding☆67Updated 2 years ago
- [NeurIPS 2022] Official PyTorch implementation of Optimizing Relevance Maps of Vision Transformers Improves Robustness. This code allows …☆131Updated 2 years ago
- ☆46Updated 4 years ago
- PyTorch samplers that output roughly balanced batches with support for multilabel datasets☆57Updated last year
- Using LSTM or Transformer to solve Image Captioning in Pytorch☆78Updated 4 years ago
- ☆26Updated 4 years ago
- Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval☆57Updated 3 years ago
- Pytorch implementation of VQA: Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf) using VQA v2.0 dataset for open-ended ta…☆20Updated 5 years ago
- SimVLM ---SIMPLE VISUAL LANGUAGE MODEL PRETRAINING WITH WEAK SUPERVISION☆36Updated 2 years ago
- Implementing PolyLoss in Pytorch☆76Updated 3 years ago
- ☆60Updated 4 years ago
- Collection of tools to support submissions to the 3rd VIPriors workshop challenges☆69Updated 2 years ago
- A PyTorch implementation of the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention☆84Updated 5 years ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Updated 2 years ago
- Localization of thoracic abnormalities model based on VinBigData (top 1%)☆45Updated 4 years ago
- ☆158Updated 3 years ago
- Code of Dense Relational Captioning☆69Updated 2 years ago
- Code for CVPR 2021 paper: Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning☆86Updated 4 years ago
- Multi-label Classification using PyTorch on the CelebA dataset.☆24Updated 5 years ago
- ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill Identification (CVPR 2020 VL3)☆88Updated 3 years ago
- Easiest way of fine-tuning HuggingFace video classification models☆142Updated 2 years ago
- ☆146Updated 4 years ago
- CLIP (Contrastive Language–Image Pre-training) for Italian☆186Updated 2 years ago