CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
☆145Jun 10, 2022Updated 3 years ago
Alternatives and similar repositories for clip-gen
Users that are interested in clip-gen are comparing it to the libraries listed below
Sorting:
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- Dreamfusion with Stable diffusion backend☆10Oct 4, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆16Oct 19, 2022Updated 3 years ago
- A variant on ashawkey/stable-dreamfusion, operating in latent space☆72Oct 8, 2022Updated 3 years ago
- ☆484Jun 30, 2022Updated 3 years ago
- Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)☆183Mar 23, 2023Updated 2 years ago
- ☆24Mar 30, 2024Updated last year
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- Implementation UniTune based on stable diffusion☆40Nov 15, 2022Updated 3 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Text-writing denoising diffusion (and much more)☆30May 14, 2023Updated 2 years ago
- Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, docu…☆112Feb 11, 2022Updated 4 years ago
- ☆17Dec 28, 2023Updated 2 years ago
- [CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models☆867Mar 27, 2023Updated 2 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- ☆150Oct 19, 2023Updated 2 years ago
- Home of `erlich` and `ongo`. Finetune latent-diffusion/glid-3-xl text2image on your own data.☆181Aug 5, 2022Updated 3 years ago
- ☆157Jan 20, 2023Updated 3 years ago
- BigGAN-AM improves the sample diversity of BigGAN and synthesizes Places365 images.☆20Oct 3, 2023Updated 2 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition" (CVPR 2022)☆323Jul 19, 2022Updated 3 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- ☆354May 10, 2022Updated 3 years ago
- [NeurIPS'20] Learning Semantic-aware Normalization for Generative Adversarial Networks☆53May 14, 2021Updated 4 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- ☆64May 23, 2022Updated 3 years ago
- Modeling Artistic Workflows for Image Generation and Editing (ECCV 2020)☆90Sep 24, 2020Updated 5 years ago
- stylegan3_blending☆39Dec 1, 2021Updated 4 years ago
- ☆39Jul 20, 2022Updated 3 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- Image and video processing toolbox☆10Jun 12, 2020Updated 5 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆13Dec 16, 2024Updated last year
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆337Aug 9, 2022Updated 3 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- ☆25Mar 31, 2022Updated 3 years ago