Pranshu258 / Deep_Image_Captioning
Deep Reinforcement Learning based Image Captioning with Embedding Reward
☆26Updated last month
Related projects: ⓘ
- Human-like Controllable Image Captioning with Verb-specific Semantic Roles.☆36Updated 2 years ago
- Microsoft COCO Caption Evaluation Tool - Python 3☆30Updated 5 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Updated last year
- Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)☆64Updated 3 years ago
- Unpaired Image Captioning☆35Updated 3 years ago
- Code of Dense Relational Captioning☆67Updated last year
- Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …☆60Updated last year
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆24Updated 4 years ago
- Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)☆44Updated last year
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21Updated last year
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆13Updated 9 months ago
- A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'☆10Updated 4 years ago
- Subjective Image Captioning using Capsule Generative Adversarial Network☆13Updated 3 years ago
- A reading list of papers about Visual Question Answering.☆31Updated 2 years ago
- A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.☆46Updated 2 years ago
- ☆39Updated last year
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆26Updated 3 years ago
- Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)☆119Updated last year
- Compact Trilinear Interaction for Visual Question Answering (ICCV 2019)☆38Updated last year
- a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD☆14Updated 2 years ago
- A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)☆143Updated 3 years ago
- [ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"☆94Updated last month
- Implementation of paper "Improving Image Captioning with Better Use of Caption"☆32Updated 4 years ago
- Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020☆81Updated 4 years ago
- ☆26Updated last year
- Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019☆49Updated 4 years ago
- A simplified pytorch version of densecap☆39Updated last year
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆115Updated 2 years ago
- Phrase Localization Evaluation Toolkit☆19Updated 5 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆80Updated 2 years ago