THUDM / CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,753Updated last year
Alternatives and similar repositories for CogView:
Users that are interested in CogView are comparing it to the libraries listed below
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆948Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆547Updated 2 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,397Updated last year
- A unified 3D Transformer Pipeline for visual synthesis☆2,807Updated last year
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,583Updated 11 months ago
- Taming Transformers for High-Resolution Image Synthesis☆6,007Updated 6 months ago
- ☆3,222Updated 9 months ago
- Official implementation of "DCT-Net: Domain-Calibrated Translation for Portrait Stylization", SIGGRAPH 2022 (TOG); Multi-style cartooniza…☆797Updated last year
- The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)☆833Updated last year
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,327Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆885Updated 11 months ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆763Updated 6 months ago
- ☆1,465Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,599Updated last year
- Pretrained Dalle2 from laion☆500Updated last year
- ☆1,030Updated last year
- Official implementation of VQ-Diffusion☆914Updated 10 months ago
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"☆1,132Updated last year
- Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744☆914Updated 6 months ago
- ☆2,956Updated last year
- Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts☆4,523Updated 5 months ago
- [CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer☆1,655Updated last year
- Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)☆4,051Updated last year
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,951Updated 9 months ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,461Updated 9 months ago
- Open-Set Grounded Text-to-Image Generation☆2,076Updated 11 months ago
- ☆1,166Updated 2 years ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,054Updated last month
- Code for Text2Human (SIGGRAPH 2022). Paper: Text2Human: Text-Driven Controllable Human Image Generation☆837Updated 6 months ago
- [TOG 2022] SofGAN: A Portrait Image Generator with Dynamic Styling☆769Updated last year