MIMICLab/BITTERS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MIMICLab/BITTERS)

MIMICLab / BITTERS

Large-Scale Bidirectional Training for Zero-Shot Image Captioning

☆21

Alternatives and similar repositories for BITTERS

Users that are interested in BITTERS are comparing it to the libraries listed below

Sorting:

MIMICLab / L-Verse
View on GitHub
L-Verse: Bidirectional Generation Between Image and Text
☆107Apr 1, 2025Updated 11 months ago
aimagelab / camel
View on GitHub
CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022
☆29Dec 1, 2022Updated 3 years ago
MILVLG / mt-captioning
View on GitHub
A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning
☆25Sep 4, 2020Updated 5 years ago
kakaobrain / noc
View on GitHub
☆47Apr 29, 2024Updated last year
haozheji / DiscoDVT
View on GitHub
EMNLP2021 - DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer
☆27Mar 21, 2022Updated 4 years ago
cswhjiang / Recurrent_Fusion_Network
View on GitHub
Source code for "Recurrent Fusion Network for Image Captioning".
☆23Nov 24, 2018Updated 7 years ago
ezeli / Transformer_model
View on GitHub
A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.
☆12Nov 15, 2021Updated 4 years ago
chenxy99 / SD-FSIC
View on GitHub
Official code for the paper "Self-Distillation for Few-Shot Image Captioning"
☆16Mar 15, 2021Updated 5 years ago
fkxssaa / Deliberate-Attention-Networks-for-Image-Captioning
View on GitHub
Deliberate Attention Networks for Image Captioning (AAAI 2019)
☆11Sep 30, 2019Updated 6 years ago
ayouboumani / image-captioning-with-attention
View on GitHub
A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'
☆10Jan 20, 2020Updated 6 years ago
Sxela / YADA
View on GitHub
Yet Another Diffusion Automation
☆13Aug 21, 2022Updated 3 years ago
mlfoundations / clip_quality_not_quantity
View on GitHub
☆29Oct 18, 2022Updated 3 years ago
e- / PANENE
View on GitHub
PANENE: Progressive Approximate NEarest NEighbors
☆20Feb 12, 2025Updated last year
ShiYaya / emscore
View on GitHub
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Oct 20, 2022Updated 3 years ago
wenhuchen / Semi-Supervised-Image-Captioning
View on GitHub
Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"
☆21Dec 26, 2016Updated 9 years ago
kojima-takeshi188 / CFA
View on GitHub
☆12Jul 21, 2022Updated 3 years ago
ShemoonX / Chinese-image-caption
View on GitHub
Image Chinese Description Generation Based on Multi-level Selective Visual Semantic Attributes
☆16Nov 2, 2021Updated 4 years ago
LAION-AI / LAION-PEOPLE
View on GitHub
This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…
☆14Jan 2, 2022Updated 4 years ago
hiteshK03 / Remote-sensing-image-captioning-with-transformer-and-multilabel-classification
View on GitHub
☆18Nov 23, 2022Updated 3 years ago
akjayant / Image-Captioning-via-YOLOv5-EncoderDecoderwithAttention
View on GitHub
Image Captioning using combination of object detection via YOLOv5 and Encoder Decoder LSTM model
☆15Oct 13, 2022Updated 3 years ago
LezJ / SimMLM
View on GitHub
Official code repo of SimMLM [ICCV 2025]
☆22Dec 1, 2025Updated 3 months ago
hichoe95 / Artifact-Detection-and-Sequential-Ablation
View on GitHub
[IJCAI-2022] Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?
☆24Nov 19, 2024Updated last year
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 2 years ago
hichoe95 / Rarity-Score
View on GitHub
[ICLR-2023] Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images
☆68Aug 5, 2022Updated 3 years ago
chaddy1004 / tSNE-helper
View on GitHub
code to help with tsne plotting
☆16May 19, 2020Updated 5 years ago
psoulos / role-decomposition
View on GitHub
☆11Feb 11, 2020Updated 6 years ago
nabihach / IDA
View on GitHub
☆13Jan 8, 2020Updated 6 years ago
dongjunKANG / VIM
View on GitHub
☆11Oct 16, 2023Updated 2 years ago
AiMl-hub / UPLM
View on GitHub
Uncertainty-Guided Pseudo-Labelling with Model Averaging
☆11Updated this week
hannandarryl / ManyModalQA
View on GitHub
Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs
☆17Mar 2, 2020Updated 6 years ago
zeran4 / TechtreeAI
View on GitHub
☆11Dec 9, 2017Updated 8 years ago
MikeWangWZHL / Zemi
View on GitHub
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆15May 3, 2023Updated 2 years ago
terryum / tf.data-for-keras-and-tensorflow-estimator
View on GitHub
tf.data examples for Keras and estimator models
☆22Oct 2, 2018Updated 7 years ago
s1879281 / Image-Captioning-with-Adaptive-Attention
View on GitHub
PyTorch implementation of image captioning with adaptive attention mechanism.
☆18Mar 23, 2019Updated 6 years ago
WuJie1010 / Fine-Grained-Image-Captioning
View on GitHub
The pytorch implementation on “Fine-Grained Image Captioning with Global-Local Discriminative Objective”
☆21Oct 17, 2019Updated 6 years ago
BenjaminJonghyun / SuperStyleNet
View on GitHub
SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder (BMVC 2021)
☆27Dec 28, 2021Updated 4 years ago
ldynx / SAVE
View on GitHub
☆25Nov 22, 2024Updated last year
tgisaturday / Seq2CNN
View on GitHub
Word Embedding Annealing Using Sequence-to-sequence Model
☆16Dec 2, 2020Updated 5 years ago
LGAI-Research / EXAONE-Atelier
View on GitHub
Jupyter notebook examples for EXAONE Atelier in AWS Marketplace
☆14Dec 8, 2023Updated 2 years ago