Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,796Sep 25, 2023Updated 2 years ago
Alternatives and similar repositories for CogView
Users that are interested in CogView are comparing it to the libraries listed below
Sorting:
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆955Aug 3, 2022Updated 3 years ago
- Taming Transformers for High-Resolution Image Synthesis☆6,451Jul 30, 2024Updated last year
- Official implementation of VQ-Diffusion☆978Apr 17, 2024Updated last year
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,689Mar 8, 2024Updated 2 years ago
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,532Nov 4, 2025Updated 4 months ago
- A unified 3D Transformer Pipeline for visual synthesis☆2,810May 29, 2023Updated 2 years ago
- Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)☆4,123May 30, 2023Updated 2 years ago
- ☆1,195Sep 29, 2022Updated 3 years ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,557Apr 24, 2024Updated last year
- Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.☆2,653Oct 2, 2022Updated 3 years ago
- ☆7,318Jul 2, 2024Updated last year
- ☆97Feb 20, 2026Updated last month
- Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]☆584Jun 4, 2024Updated last year
- PyTorch package for the discrete VAE used for DALL·E.☆10,875Jan 31, 2024Updated 2 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,861Feb 18, 2026Updated last month
- High-Resolution Image Synthesis with Latent Diffusion Models☆13,906Feb 29, 2024Updated 2 years ago
- [CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models☆866Mar 27, 2023Updated 2 years ago
- Styled text-to-drawing synthesis method. Featured at IJCAI 2022 and the 2021 NeurIPS Workshop on Machine Learning for Creativity and Desi…☆282Nov 15, 2022Updated 3 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,476May 31, 2023Updated 2 years ago
- PyTorch implementation of a 1.3B text-to-image generation model trained on 14 million image-text pairs☆634Aug 9, 2022Updated 3 years ago
- [SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets☆994Jun 24, 2024Updated last year
- GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)☆7,664Jul 25, 2023Updated 2 years ago
- [CVPR 2021] Pytorch implementation for TediGAN: Text-Guided Diverse Face Image Generation and Manipulation☆391Mar 13, 2023Updated 3 years ago
- ☆485Jun 30, 2022Updated 3 years ago
- v objective diffusion inference code for PyTorch.☆718Nov 29, 2022Updated 3 years ago
- ☆3,051Feb 27, 2023Updated 3 years ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆126Mar 14, 2022Updated 4 years ago
- Generate images from texts. In Russian☆1,649Jan 10, 2023Updated 3 years ago
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆4,380Oct 19, 2025Updated 5 months ago
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,694Mar 3, 2026Updated 2 weeks ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- [CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing☆782Oct 3, 2023Updated 2 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆337Aug 9, 2022Updated 3 years ago
- An open source implementation of CLIP.☆13,528Mar 12, 2026Updated last week
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,189Nov 18, 2024Updated last year
- COYO-700M: Large-scale Image-Text Pair Dataset☆1,251Nov 30, 2022Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,046Jan 23, 2026Updated last month
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,336Aug 10, 2023Updated 2 years ago
- ☆1,074Sep 18, 2024Updated last year