Generate text captions for images from their embeddings.
☆119Aug 1, 2023Updated 2 years ago
Alternatives and similar repositories for clip-text-decoder
Users that are interested in clip-text-decoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆138Mar 16, 2023Updated 3 years ago
- Image Captioning using combination of object detection via YOLOv5 and Encoder Decoder LSTM model☆15Oct 13, 2022Updated 3 years ago
- [ECCV2022] Source Code for "Improving GANs for Long-Tailed Data through Group Spectral Regularization"☆16Oct 2, 2022Updated 3 years ago
- babyLM WhisBERT code☆19May 27, 2024Updated last year
- A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.☆12Nov 15, 2021Updated 4 years ago
- ☆11Oct 2, 2024Updated last year
- A Pytorch implementation of the paper 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering'☆10Jan 20, 2020Updated 6 years ago
- Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic☆278Sep 17, 2022Updated 3 years ago
- implementation of paper https://arxiv.org/abs/2210.04559☆57Nov 26, 2025Updated 3 months ago
- ☆12Sep 19, 2021Updated 4 years ago
- t-vMF Similarity for Regularizing Intra-Class Feature Distribution☆21Jun 11, 2021Updated 4 years ago
- Large-Scale Bidirectional Training for Zero-Shot Image Captioning☆21Feb 14, 2023Updated 3 years ago
- S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions☆51May 26, 2023Updated 2 years ago
- DDSP-FM: a differentiable FM synth based on Magenta's DDSP library.☆19Jun 14, 2021Updated 4 years ago
- Simple image captioning model☆1,413Jun 9, 2024Updated last year
- WildLife Documentary Dataset☆14Jun 19, 2017Updated 8 years ago
- Synthesis of percussion sounds using sinusoidal modelling, DDSP noise synthesis, and a neural source filter approach.☆31Jan 7, 2025Updated last year
- Using LLMs and pre-trained caption models for super-human performance on image captioning.☆42Oct 13, 2023Updated 2 years ago
- Official implementation of "Perturbed-Attention Guidance"☆60Jul 2, 2024Updated last year
- [BMVC2024] Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning☆14Feb 14, 2026Updated last month
- Codes and scripts for "Explainable Semantic Space by Grounding Languageto Vision with Cross-Modal Contrastive Learning"☆20Mar 23, 2022Updated 4 years ago
- Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI☆11Mar 3, 2024Updated 2 years ago
- InstructionGPT-4☆42Dec 29, 2023Updated 2 years ago
- Official implementation of OSSGAN [CVPR 2022]☆21May 2, 2022Updated 3 years ago
- ☆16Jan 3, 2023Updated 3 years ago
- ☆22Sep 13, 2021Updated 4 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆18Mar 15, 2021Updated 5 years ago
- Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”☆76Jun 6, 2023Updated 2 years ago
- A Differentiable Acoustic Guitar Model for String-Specific Polyphonic Synthesis☆17Nov 16, 2023Updated 2 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆100Mar 11, 2023Updated 3 years ago
- Torch implementation of orthoreg.☆15Oct 27, 2021Updated 4 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- ☆16May 7, 2023Updated 2 years ago
- BEAR: a new BEnchmark on video Action Recognition☆46Apr 21, 2024Updated last year
- Deliberate Attention Networks for Image Captioning (AAAI 2019)☆11Sep 30, 2019Updated 6 years ago
- Implementation for CVPR2021 paper "Joint Generative and Contrastive Learning for Unsupervised Person Re-identification"☆48Dec 6, 2021Updated 4 years ago
- Demos of neural image editing☆11Mar 15, 2021Updated 5 years ago
- A repo for shared Jupyter and Colab notebooks☆23Jul 4, 2025Updated 8 months ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Sep 1, 2022Updated 3 years ago