bearcatt/LaBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bearcatt/LaBERT)

bearcatt / LaBERT

A length-controllable and non-autoregressive image captioning model.

☆69

Alternatives and similar repositories for LaBERT

Users that are interested in LaBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

visinf / cos-cvae
View on GitHub
Diverse Image Captioning with Context-Object Split Latent Spaces (NeurIPS 2020)
☆37May 16, 2022Updated 4 years ago
bladewaltz1 / ModeCap
View on GitHub
Controllable mage captioning model with unsupervised modes
☆21Apr 14, 2023Updated 3 years ago
princetonvisualai / SPICE-U
View on GitHub
☆11Sep 7, 2020Updated 5 years ago
LuoweiZhou / VLP
View on GitHub
Vision-Language Pre-training for Image Captioning and Question Answering
☆420Jan 18, 2022Updated 4 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Gitsamshi / WeakVRD-Captioning
View on GitHub
Implementation of paper "Improving Image Captioning with Better Use of Caption"
☆33Sep 15, 2020Updated 5 years ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
yangbang18 / Non-Autoregressive-Video-Captioning
View on GitHub
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
☆57Oct 22, 2023Updated 2 years ago
forence / Awesome-Visual-Captioning
View on GitHub
This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP
☆410Nov 14, 2022Updated 3 years ago
syuqings / video-paragraph
View on GitHub
Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021
☆66Oct 21, 2021Updated 4 years ago
feizc / PNAIC
View on GitHub
Partially Non-Autoregressive Image Captioning
☆10Sep 30, 2021Updated 4 years ago
YuanEZhou / Grounded-Image-Captioning
View on GitHub
☆64Jan 5, 2022Updated 4 years ago
fawazsammani / show-edit-tell
View on GitHub
Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020
☆82Jul 17, 2020Updated 6 years ago
cshizhe / asg2cap
View on GitHub
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., …
☆200Dec 1, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
njucckevin / KnowCap
View on GitHub
Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
☆13Feb 15, 2024Updated 2 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
JDAI-CV / image-captioning
View on GitHub
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
☆273Jul 27, 2021Updated 4 years ago
ShiYaya / emscore
View on GitHub
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Oct 20, 2022Updated 3 years ago
yangxuntu / SGAE
View on GitHub
☆218Feb 26, 2022Updated 4 years ago
LuoweiZhou / detectron-vlp
View on GitHub
Detectron for image/video region feature extraction, inspired by Xinlei's repo
☆22Nov 21, 2020Updated 5 years ago
lukemelas / image-paragraph-captioning
View on GitHub
[EMNLP 2018] Training for Diversity in Image Paragraph Captioning
☆91Sep 12, 2019Updated 6 years ago
qingzwang / DiversityMetrics
View on GitHub
This is the implementation of self-CIDEr and LSA-based diversity metrics (only for python 2.7).
☆37Feb 26, 2022Updated 4 years ago
jssprz / attentive_specialized_network_video_captioning
View on GitHub
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
☆15Apr 6, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aimagelab / show-control-and-tell
View on GitHub
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
☆281Dec 21, 2022Updated 3 years ago
aimagelab / meshed-memory-transformer
View on GitHub
Meshed-Memory Transformer for Image Captioning. CVPR 2020
☆546Dec 21, 2022Updated 3 years ago
husthuaan / AAT
View on GitHub
Code for paper "Adaptively Aligned Image Captioning via Adaptive Attention Time". NeurIPS 2019
☆50Dec 18, 2019Updated 6 years ago
zinengtang / VidLanKD
View on GitHub
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
☆56Feb 6, 2023Updated 3 years ago
yiyang92 / vae_captioning
View on GitHub
Implementation of Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space
☆60Apr 5, 2018Updated 8 years ago
yj-yu / lsmdc
View on GitHub
☆33Nov 12, 2018Updated 7 years ago
szq0214 / MSR-VTT-Challenge
View on GitHub
Video to Language Challenge (MSR-VTT Challenge 2016)
☆32Dec 28, 2017Updated 8 years ago
alasdairtran / transform-and-tell
View on GitHub
[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning
☆93Apr 19, 2024Updated 2 years ago
tgGuo15 / PriorImageCaption
View on GitHub
☆30Oct 2, 2018Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kdexd / virtex
View on GitHub
[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
☆561Aug 22, 2025Updated 11 months ago
fenglinliu98 / MIA
View on GitHub
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" （NeurIPS 2019）
☆65Oct 19, 2020Updated 5 years ago
vsislab / Controllable_XGating
View on GitHub
ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
☆68Nov 19, 2019Updated 6 years ago
fengyang0317 / unsupervised_captioning
View on GitHub
Code for Unsupervised Image Captioning
☆223Mar 24, 2023Updated 3 years ago
eric-xw / Video-guided-Machine-Translation
View on GitHub
Starter code for the VMT task and challenge
☆51Jul 29, 2020Updated 5 years ago
husthuaan / AoANet
View on GitHub
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
☆339May 2, 2021Updated 5 years ago
jayleicn / mTVRetrieval
View on GitHub
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆27Aug 20, 2022Updated 3 years ago