Large-Scale Bidirectional Training for Zero-Shot Image Captioning
☆21Feb 14, 2023Updated 3 years ago
Alternatives and similar repositories for BITTERS
Users that are interested in BITTERS are comparing it to the libraries listed below
Sorting:
- L-Verse: Bidirectional Generation Between Image and Text☆107Apr 1, 2025Updated 11 months ago
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆29Dec 1, 2022Updated 3 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- ☆47Apr 29, 2024Updated last year
- EMNLP2021 - DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer☆27Mar 21, 2022Updated 4 years ago
- Source code for "Recurrent Fusion Network for Image Captioning".☆23Nov 24, 2018Updated 7 years ago
- A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.☆12Nov 15, 2021Updated 4 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆16Mar 15, 2021Updated 5 years ago
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'☆10Jan 20, 2020Updated 6 years ago
- Yet Another Diffusion Automation☆13Aug 21, 2022Updated 3 years ago
- ☆29Oct 18, 2022Updated 3 years ago
- PANENE: Progressive Approximate NEarest NEighbors☆20Feb 12, 2025Updated last year
- Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆26Oct 20, 2022Updated 3 years ago
- Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"☆21Dec 26, 2016Updated 9 years ago
- ☆12Jul 21, 2022Updated 3 years ago
- Image Chinese Description Generation Based on Multi-level Selective Visual Semantic Attributes☆16Nov 2, 2021Updated 4 years ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Jan 2, 2022Updated 4 years ago
- ☆18Nov 23, 2022Updated 3 years ago
- Image Captioning using combination of object detection via YOLOv5 and Encoder Decoder LSTM model☆15Oct 13, 2022Updated 3 years ago
- Official code repo of SimMLM [ICCV 2025]☆22Dec 1, 2025Updated 3 months ago
- [IJCAI-2022] Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?☆24Nov 19, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [ICLR-2023] Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images☆68Aug 5, 2022Updated 3 years ago
- code to help with tsne plotting☆16May 19, 2020Updated 5 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- ☆13Jan 8, 2020Updated 6 years ago
- ☆11Oct 16, 2023Updated 2 years ago
- Uncertainty-Guided Pseudo-Labelling with Model Averaging☆11Updated this week
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Mar 2, 2020Updated 6 years ago
- ☆11Dec 9, 2017Updated 8 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 2 years ago
- tf.data examples for Keras and estimator models☆22Oct 2, 2018Updated 7 years ago
- PyTorch implementation of image captioning with adaptive attention mechanism.☆18Mar 23, 2019Updated 6 years ago
- The pytorch implementation on “Fine-Grained Image Captioning with Global-Local Discriminative Objective”☆21Oct 17, 2019Updated 6 years ago
- SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder (BMVC 2021)☆27Dec 28, 2021Updated 4 years ago
- ☆25Nov 22, 2024Updated last year
- Word Embedding Annealing Using Sequence-to-sequence Model☆16Dec 2, 2020Updated 5 years ago
- Jupyter notebook examples for EXAONE Atelier in AWS Marketplace☆14Dec 8, 2023Updated 2 years ago