THUDM / CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,783Updated last year
Alternatives and similar repositories for CogView:
Users that are interested in CogView are comparing it to the libraries listed below
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆951Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆547Updated 2 years ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,501Updated last year
- A unified 3D Transformer Pipeline for visual synthesis☆2,811Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,415Updated last year
- Taming Transformers for High-Resolution Image Synthesis☆6,149Updated 9 months ago
- The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)☆883Updated last year
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,224Updated 9 months ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆771Updated 9 months ago
- Code for Text2Human (SIGGRAPH 2022). Paper: Text2Human: Text-Driven Controllable Human Image Generation☆844Updated 9 months ago
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,331Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,612Updated last year
- ☆1,571Updated 2 years ago
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆888Updated 2 years ago
- Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744☆923Updated 9 months ago
- Pretrained Dalle2 from laion☆503Updated 2 years ago
- [TOG 2022] SofGAN: A Portrait Image Generator with Dynamic Styling☆773Updated last year
- Official implementation of "DCT-Net: Domain-Calibrated Translation for Portrait Stylization", SIGGRAPH 2022 (TOG); Multi-style cartooniza…☆807Updated last year
- OpenAI CLIP text encoders for multiple languages!☆796Updated last year
- ☆895Updated 10 months ago
- ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab☆2,044Updated last year
- ☆3,296Updated 11 months ago
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,328Updated last year
- Open-Set Grounded Text-to-Image Generation☆2,115Updated last year
- Easily compute clip embeddings and build a clip retrieval system with them☆2,546Updated last year
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆709Updated last year
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,077Updated 4 months ago
- StyleGAN-Human: A Data-Centric Odyssey of Human Generation☆1,176Updated 3 months ago
- Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)☆4,083Updated last year
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,308Updated last year