luo3300612 / Awesome-image-captioningLinks
image captioning paper list
☆8Updated 5 years ago
Alternatives and similar repositories for Awesome-image-captioning
Users that are interested in Awesome-image-captioning are comparing it to the libraries listed below
Sorting:
- ☆19Updated 2 years ago
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features☆12Updated 4 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- Lightweight Transformer for Multi-modal Tasks☆16Updated 2 years ago
- 2021 AAAI Modular Graph Transformer Networks for Multi-Label Image Classification; Official GitHub: https://github.com/ReML-AI/MGTN☆21Updated 3 years ago
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- ☆11Updated 4 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Updated 4 years ago
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆23Updated 3 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆14Updated 4 years ago
- Code of our Neurips2020 paper "Auto Learning Attention", coming soon☆22Updated 4 years ago
- SMCA replication☆21Updated 3 years ago
- awesome video-based self-supervised learning methods in recently years☆9Updated 4 years ago
- ☆9Updated 2 years ago
- ☆26Updated 4 years ago
- ☆42Updated 4 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 3 years ago
- ☆20Updated 4 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Updated 3 years ago
- ☆15Updated 3 years ago
- A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021☆31Updated 3 years ago
- "Exploring Vision Transformers for Fine-grained Classification" at CVPRW FGVC8☆27Updated 4 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- 🏆 The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.☆17Updated 4 years ago
- ☆20Updated 3 years ago
- [AAAI'20] Code release for "HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs".☆38Updated last year
- Image captioning with weight pruning in PyTorch☆22Updated 3 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆11Updated 3 years ago
- This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…☆12Updated 4 years ago
- Semi-supervised learning with Grad-CAM consistency regularization☆10Updated 3 years ago