luo3300612 / Awesome-image-captioning
image captioning paper list
☆8Updated 5 years ago
Alternatives and similar repositories for Awesome-image-captioning:
Users that are interested in Awesome-image-captioning are comparing it to the libraries listed below
- ☆19Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated 2 years ago
- ☆9Updated 2 years ago
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features☆12Updated 4 years ago
- ☆11Updated 4 years ago
- ☆26Updated 3 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- SMCA replication☆21Updated 3 years ago
- ☆15Updated 3 years ago
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- awesome video-based self-supervised learning methods in recently years☆9Updated 4 years ago
- Code for the Paper Learning Hierarchy Aware Features for Reducing Mistake Severity, accepted in ECCV 2022☆15Updated 2 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Updated 2 years ago
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆23Updated 3 years ago
- ☆19Updated 3 years ago
- A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021☆31Updated 3 years ago
- "Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8☆27Updated 3 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆10Updated 4 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- Semi-supervised learning with Grad-CAM consistency regularization☆10Updated 3 years ago
- Bag of MLP☆20Updated 3 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆13Updated 4 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Updated 4 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- Rethinking Nearest Neighbors for Visual Classification☆31Updated 3 years ago
- ☆19Updated 3 years ago
- Phrase Localization Evaluation Toolkit☆20Updated 5 years ago
- ☆10Updated 2 years ago
- Code of our Neurips2020 paper "Auto Learning Attention", coming soon☆22Updated 3 years ago