guanghuixu/AnchorCaptioner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guanghuixu/AnchorCaptioner)

guanghuixu / AnchorCaptioner

☆30

Alternatives and similar repositories for AnchorCaptioner

Users that are interested in AnchorCaptioner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

guanghuixu / CRN_tvqa
View on GitHub
☆15Oct 27, 2020Updated 5 years ago
ronghanghu / mmf
View on GitHub
A modular framework for Visual Question Answering research by the FAIR A-STAR team
☆45Aug 26, 2021Updated 4 years ago
ZephyrZhuQi / ssbaseline
View on GitHub
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]
☆57Apr 5, 2022Updated 4 years ago
microsoft / TAP
View on GitHub
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)
☆72May 22, 2023Updated 3 years ago
yashkant / sam-textvqa
View on GitHub
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
☆65Sep 15, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Gitsamshi / WeakVRD-Captioning
View on GitHub
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆33Sep 15, 2020Updated 5 years ago
ChenyuGAO-CS / SMA
View on GitHub
The imdb files with SBD-Trans OCR for TextVQA dataset.
☆11Nov 30, 2021Updated 4 years ago
entalent / MemCap
View on GitHub
code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`
☆11Mar 17, 2020Updated 6 years ago
CUMTGG / CIIC
View on GitHub
☆18Sep 13, 2023Updated 2 years ago
AndresPMD / StacMR
View on GitHub
Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
zhangxuying1004 / RSTNet
View on GitHub
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
☆123Dec 17, 2022Updated 3 years ago
husthuaan / AoANet
View on GitHub
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
☆339May 2, 2021Updated 5 years ago
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
HAWLYQ / Qc-TextCap
View on GitHub
☆16Dec 25, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
luo3300612 / image-captioning-DLCT
View on GitHub
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
☆203Jun 8, 2022Updated 4 years ago
YiwuZhong / Sub-GC
View on GitHub
[ECCV 2020] Official code for "Comprehensive Image Captioning via Scene Graph Decomposition"
☆99Aug 20, 2024Updated last year
ruotianluo / DiscCaptioning
View on GitHub
Code for Discriminability objective for training descriptive captions(CVPR 2018)
☆109Nov 21, 2019Updated 6 years ago
GT-RIPL / Xmodal-Ctx
View on GitHub
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …
☆61Oct 21, 2022Updated 3 years ago
yangxuntu / SGAE
View on GitHub
☆218Feb 26, 2022Updated 4 years ago
cshizhe / asg2cap
View on GitHub
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …
☆200Dec 1, 2022Updated 3 years ago
LibertFan / ImageCaption
View on GitHub
Bridging by Word: Image-Grounded Vocabulary Construction for Visual Captioning based in ACL2019
☆17Sep 8, 2019Updated 6 years ago
CuthbertCai / Ask-Confirm
View on GitHub
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)
☆20Dec 4, 2021Updated 4 years ago
facebookresearch / grid-feats-vqa
View on GitHub
Grid features pre-training code for visual question answering
☆269Sep 17, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
232525 / PureT
View on GitHub
Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]
☆70Jun 1, 2024Updated 2 years ago
CurryYuan / X-Trans2Cap
View on GitHub
[CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
☆36Aug 26, 2022Updated 3 years ago
andreineculai / MPC
View on GitHub
☆25May 11, 2022Updated 4 years ago
jacobswan1 / ViTCAP
View on GitHub
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
☆43May 28, 2022Updated 4 years ago
aimagelab / camel
View on GitHub
CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022
☆30Dec 1, 2022Updated 3 years ago
zhigao2017 / CurvatureGeneration_FSL
View on GitHub
☆13Sep 14, 2022Updated 3 years ago
uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆56Oct 30, 2024Updated last year
yangbang18 / Non-Autoregressive-Video-Captioning
View on GitHub
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
☆57Oct 22, 2023Updated 2 years ago
ecom-research / ComposeAE
View on GitHub
Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
☆56Oct 8, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lyh983012 / SNN-genunit
View on GitHub
developing tools for LIAF-SNNs and LIF-SNNs
☆10Sep 14, 2022Updated 3 years ago
huaifeng1993 / NMS
View on GitHub
the speed of nms
☆20Jul 5, 2018Updated 8 years ago
AndresPMD / GCN_classification
View on GitHub
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
☆65Dec 1, 2022Updated 3 years ago
syuqings / video-paragraph
View on GitHub
Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021
☆66Oct 21, 2021Updated 4 years ago
frostinassiky / bsp
View on GitHub
Placeholder for code of BSP.
☆11Aug 13, 2021Updated 4 years ago
xiaojino / RUArt
View on GitHub
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
hedongxiao-tju / NSLM
View on GitHub
Code & data accompanying the paper ["Unveiling Implicit Deceptive Patterns in Multi-modal Fake News via Neuro-Symbolic Reasoning"].
☆13Dec 21, 2023Updated 2 years ago