yurayli / image-caption-pytorchLinks

image captioning with flikr8k dataset

☆14

Alternatives and similar repositories for image-caption-pytorch

Users that are interested in image-caption-pytorch are comparing it to the libraries listed below

Sorting:

vatsalsaglani / MultiLabelClassifier
Multi-label Classification using PyTorch on the CelebA dataset.
☆25Updated 5 years ago
YuanEZhou / satic
☆26Updated 4 years ago
luo3300612 / Awesome-image-captioning
image captioning paper list
☆8Updated 5 years ago
The-AI-Summer / simclr
An education step by step implementation of SimCLR that accompanies the blogpost
☆32Updated 3 years ago
issamemari / pytorch-multilabel-balanced-sampler
PyTorch samplers that output roughly balanced batches with support for multilabel datasets
☆57Updated last year
gchhablani / multilingual-vqa
Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.
☆34Updated 3 years ago
facebookresearch / connect-caption-and-trace
A unified framework to jointly model images, text, and human attention traces.
☆78Updated 4 years ago
gchhablani / multilingual-image-captioning
☆44Updated 3 years ago
weiyx16 / CLIP-pytorch
A non-JIT version implementation / replication of CLIP of OpenAI in pytorch
☆34Updated 4 years ago
intersun / LightningDOT
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT
☆72Updated 2 years ago
ankandrew / online-label-smoothing-pt
Implementation of Online Label Smoothing in PyTorch
☆94Updated 2 years ago
byeongjokim / VIPriors-Image-Classification-Challenge
☆17Updated 4 years ago
ttchengab / mixup
☆19Updated 4 years ago
suetAndTie / cycle-image-gan
☆28Updated 5 years ago
firesans / STRforIndicLanguages
PyTorch implementation of STR models for transfer learning in Indic Languages
☆16Updated 3 years ago
Kirill-Kravtsov / drophead-pytorch
An implementation of drophead regularization for pytorch transformers
☆19Updated 3 years ago
wtliao / ImageTransformer
Image Captioning through Image Transformer
☆40Updated 4 years ago
sayakpaul / Learnable-Image-Resizing
TF 2 implementation Learning to Resize Images for Computer Vision Tasks (https://arxiv.org/abs/2103.09950v1).
☆53Updated 3 years ago
ntusteeian / VQA_CNN-LSTM
Pytorch implementation of VQA: Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf) using VQA v2.0 dataset for open-ended ta…
☆20Updated 4 years ago
cardinalblue / clip-models-for-distillation
☆18Updated last year
jackryo / ricap
☆23Updated 6 years ago
ShihaoShao-GH / 1st-Place-Solution-in-Google-Universal-Image-Embedding
1st Place Solution in Google Universal Image Embedding
☆65Updated 2 years ago
usuyama / ePillID-benchmark
ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill Identification (CVPR 2020 VL3)
☆87Updated 3 years ago
ucasligang / SimViT
[ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.
☆68Updated 2 years ago
lucidrains / cross-transformers-pytorch
Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch
☆53Updated 4 years ago
01BB01 / eBayChallenge
[FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.
☆26Updated 2 years ago
kamalkraj / Swin-Transformer-Serve
Deploy Swin Transformer using TorchServe
☆27Updated 3 years ago
multimodal / multimodal
A collection of multimodal datasets, and visual features for VQA and captionning in pytorch. Just run "pip install multimodal"
☆82Updated 3 years ago
ZFTurbo / 2nd-place-solution-for-VinBigData-Chest-X-ray-Abnormalities-Detection
Localization of thoracic abnormalities model based on VinBigData (top 1%)
☆45Updated 4 years ago
RoyalSkye / Image-Caption
Using LSTM or Transformer to solve Image Captioning in Pytorch
☆78Updated 3 years ago