YuanEZhou/Grounded-Image-Captioning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YuanEZhou/Grounded-Image-Captioning)

YuanEZhou / Grounded-Image-Captioning

☆64

Alternatives and similar repositories for Grounded-Image-Captioning

Users that are interested in Grounded-Image-Captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
fawazsammani / show-edit-tell
View on GitHub
Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020
☆82Jul 17, 2020Updated 6 years ago
JDAI-CV / image-captioning
View on GitHub
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
☆273Jul 27, 2021Updated 4 years ago
PluviophileYU / CVC-QA
View on GitHub
Code for "Counterfactual Variable Control for Robust and Interpretable Question Answering"
☆14Oct 13, 2020Updated 5 years ago
sibeiyang / sgmn
View on GitHub
Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.
☆117Aug 10, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
malihealikhani / Cross-modal_Coherence_Modeling
View on GitHub
Cross-modal Coherence Modeling for Caption Generation
☆11Jul 24, 2020Updated 6 years ago
cshizhe / asg2cap
View on GitHub
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …
☆200Dec 1, 2022Updated 3 years ago
hassanhub / MultiGrounding
View on GitHub
This is the repo for Multi-level textual grounding
☆34Jul 21, 2020Updated 6 years ago
yangxuntu / catt
View on GitHub
☆12Mar 8, 2021Updated 5 years ago
airsplay / py-bottom-up-attention
View on GitHub
PyTorch bottom-up attention with Detectron2
☆239Jan 4, 2022Updated 4 years ago
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
daqingliu / CAVP
View on GitHub
Code release for Context-Aware Visual Policy Network for Sequence-Level Image Captioning (MM 2018) and Context-Aware Visual Policy Networ…
☆46Jul 27, 2019Updated 6 years ago
fenglinliu98 / MIA
View on GitHub
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）
☆65Oct 19, 2020Updated 5 years ago
yangxuntu / SGAE
View on GitHub
☆218Feb 26, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
husthuaan / AoANet
View on GitHub
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
☆339May 2, 2021Updated 5 years ago
Gitsamshi / WeakVRD-Captioning
View on GitHub
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆33Sep 15, 2020Updated 5 years ago
fawazsammani / look-and-modify
View on GitHub
Look and Modify: Modification Networks for Image Captioning, BMVC 2019
☆21Feb 18, 2020Updated 6 years ago
Wangt-CN / VC-R-CNN
View on GitHub
[CVPR 2020] The official pytorch implementation of ``Visual Commonsense R-CNN''
☆357May 2, 2021Updated 5 years ago
aimagelab / show-control-and-tell
View on GitHub
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
☆281Dec 21, 2022Updated 3 years ago
LuoweiZhou / VLP
View on GitHub
Vision-Language Pre-training for Image Captioning and Question Answering
☆420Jan 18, 2022Updated 4 years ago
INK-USC / VisCOLL
View on GitHub
Code and data for the project "Visually grounded continual learning of compositional semantics"
☆22Dec 27, 2022Updated 3 years ago
jiasenlu / NeuralBabyTalk
View on GitHub
Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"
☆525Mar 27, 2019Updated 7 years ago
qingzwang / DiversityMetrics
View on GitHub
This is the implementation of self-CIDEr and LSA-based diversity metrics (only for python 2.7).
☆37Feb 26, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
bearcatt / LaBERT
View on GitHub
A length-controllable and non-autoregressive image captioning model.
☆69Jun 10, 2021Updated 5 years ago
VALUE-Leaderboard / DataRelease
View on GitHub
Data Release for VALUE Benchmark
☆30Feb 16, 2022Updated 4 years ago
facebookresearch / connect-caption-and-trace
View on GitHub
A unified framework to jointly model images, text, and human attention traces.
☆80May 24, 2021Updated 5 years ago
entalent / MemCap
View on GitHub
code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`
☆11Mar 17, 2020Updated 6 years ago
daqingliu / NMTree
View on GitHub
Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)
☆38Nov 23, 2019Updated 6 years ago
SHTUPLUS / vsub
View on GitHub
The substitution of qsub.
☆12Jan 25, 2019Updated 7 years ago
alasdairtran / transform-and-tell
View on GitHub
[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning
☆93Apr 19, 2024Updated 2 years ago
ccvl / iep-ref
View on GitHub
Inferring and Executing Programs for Visual Reasoning
☆21Jan 4, 2019Updated 7 years ago
YiwuZhong / Sub-GC
View on GitHub
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
☆99Aug 20, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
delchiaro / RATT
View on GitHub
☆18Oct 3, 2023Updated 2 years ago
ronghanghu / lcgn
View on GitHub
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
☆92Aug 9, 2019Updated 6 years ago
yahoo / object_relation_transformer
View on GitHub
Implementation of the Object Relation Transformer for Image Captioning
☆180Sep 17, 2024Updated last year
BigRedT / info-ground
View on GitHub
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
☆73Aug 22, 2020Updated 5 years ago
ck0123 / improved-bertscore-for-image-captioning-evaluation
View on GitHub
☆21Jul 25, 2024Updated 2 years ago
gsig / visual-grounding
View on GitHub
Project page for "Visual Grounding in Video for Unsupervised Word Translation" CVPR 2020
☆43Apr 26, 2020Updated 6 years ago
MILVLG / bottom-up-attention.pytorch
View on GitHub
A PyTorch reimplementation of bottom-up-attention models
☆302Apr 7, 2022Updated 4 years ago