YoadTew / zero-shot-image-to-textLinks
Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
☆276Updated 2 years ago
Alternatives and similar repositories for zero-shot-image-to-text
Users that are interested in zero-shot-image-to-text are comparing it to the libraries listed below
Sorting:
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆195Updated last year
- CLIPScore EMNLP code☆223Updated 2 years ago
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆182Updated 2 years ago
- An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.☆128Updated 5 months ago
- [ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383☆411Updated 2 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆108Updated 2 months ago
- Language Models Can See: Plugging Visual Controls in Text Generation☆256Updated 3 years ago
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆192Updated 2 years ago
- ☆97Updated 7 months ago
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning