luo3300612 / Awesome-image-captioning
image captioning paper list
☆8Updated 5 years ago
Related projects: ⓘ
- ☆9Updated last year
- ☆25Updated 3 years ago
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features☆12Updated 3 years ago
- ☆19Updated last year
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- ☆42Updated 3 years ago
- awesome video-based self-supervised learning methods in recently years☆10Updated 3 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated last year
- Phrase Localization Evaluation Toolkit☆19Updated 5 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆13Updated 3 years ago
- ☆11Updated last year
- Video captioning on MSR-VTT Dataset☆12Updated 3 years ago
- Code of our Neurips2020 paper "Auto Learning Attention", coming soon☆21Updated 3 years ago
- Gender/Age attribute grounding using weak supervised manner.☆12Updated 5 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 2 years ago
- SMCA replication☆21Updated 3 years ago
- 🏆 The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.☆17Updated 3 years ago
- ☆23Updated 2 years ago
- ☆11Updated 4 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Updated 2 years ago
- Learning Cross-modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)☆13Updated 3 years ago
- Code for the Paper Learning Hierarchy Aware Features for Reducing Mistake Severity, accepted in ECCV 2022☆14Updated last year
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆46Updated 3 years ago
- CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)☆27Updated last year
- 2021 AAAI Modular Graph Transformer Networks for Multi-Label Image Classification; Official GitHub: https://github.com/ReML-AI/MGTN☆20Updated 3 years ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆16Updated 2 years ago
- ☆29Updated 11 months ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆23Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 2 years ago