luo3300612 / Awesome-image-captioning
image captioning paper list
☆8Updated 5 years ago
Alternatives and similar repositories for Awesome-image-captioning:
Users that are interested in Awesome-image-captioning are comparing it to the libraries listed below
- ☆19Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated 2 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago
- ☆26Updated 3 years ago
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- ☆9Updated 2 years ago
- ☆42Updated 3 years ago
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features☆12Updated 4 years ago
- SMCA replication☆21Updated 3 years ago
- awesome video-based self-supervised learning methods in recently years☆9Updated 4 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Updated 3 years ago
- This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It wil…☆18Updated 3 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 3 years ago
- 🏆 The 2nd Place Submission to the CVPR2021-Evoked Emotion from Videos challenge.☆17Updated 3 years ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Updated 2 years ago
- 2021 AAAI Modular Graph Transformer Networks for Multi-Label Image Classification; Official GitHub: https://github.com/ReML-AI/MGTN☆21Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Updated 4 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆13Updated 3 years ago
- ☆31Updated 2 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆23Updated 3 years ago
- ☆11Updated 4 years ago
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆38Updated last year
- PIC Challenge Baseline☆19Updated 6 years ago
- ☆19Updated 3 years ago
- ☆35Updated last year
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021☆31Updated 3 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago