terry-r123 / Awesome-CaptioningView external linksLinks
A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
☆113Jun 6, 2022Updated 3 years ago
Alternatives and similar repositories for Awesome-Captioning
Users that are interested in Awesome-Captioning are comparing it to the libraries listed below
Sorting:
- This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP☆411Nov 14, 2022Updated 3 years ago
- ☆24Apr 4, 2022Updated 3 years ago
- A paper list of image captioning.☆22Apr 23, 2022Updated 3 years ago
- Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).☆202Jun 8, 2022Updated 3 years ago
- The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".☆57Oct 22, 2023Updated 2 years ago
- Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆26Oct 20, 2022Updated 3 years ago
- A curated list of image captioning and related area resources. :-)☆1,074Mar 28, 2023Updated 2 years ago
- ☆26Oct 20, 2021Updated 4 years ago
- A curated list of research papers in Video Captioning☆121Jan 5, 2021Updated 5 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆123Dec 17, 2022Updated 3 years ago
- End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)☆228Jan 3, 2024Updated 2 years ago
- Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …☆61Oct 21, 2022Updated 3 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- ☆218Feb 26, 2022Updated 3 years ago
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆29Dec 1, 2022Updated 3 years ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆65Jul 29, 2025Updated 6 months ago
- Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"☆54Jul 9, 2021Updated 4 years ago
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆48Nov 15, 2021Updated 4 years ago
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- [ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset☆90Sep 6, 2023Updated 2 years ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Jul 27, 2021Updated 4 years ago
- Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*☆30Apr 16, 2021Updated 4 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Jul 17, 2020Updated 5 years ago
- Implementation of paper "Improving Image Captioning with Better Use of Caption"☆33Sep 15, 2020Updated 5 years ago
- A curated list of zero-shot captioning papers☆24Aug 26, 2023Updated 2 years ago
- ☆14May 10, 2021Updated 4 years ago
- [CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.☆50Sep 30, 2022Updated 3 years ago
- ☆85Dec 4, 2022Updated 3 years ago
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- ☆12Mar 21, 2024Updated last year
- Pytorch Implementation of MUCKO(2020 IJCAI)☆20Oct 25, 2020Updated 5 years ago
- The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling☆12Aug 19, 2021Updated 4 years ago
- Code for Unsupervised Image Captioning☆221Mar 24, 2023Updated 2 years ago
- Optimized code based on M2 for faster image captioning training☆21Nov 18, 2022Updated 3 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 5 years ago
- Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".☆43May 28, 2022Updated 3 years ago
- Code for paper "Attention on Attention for Image Captioning". ICCV 2019☆339May 2, 2021Updated 4 years ago
- [ICCV 2023] Accurate and Fast Compressed Video Captioning☆52Jul 28, 2025Updated 6 months ago