zai-org / CogViewLinks
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,792Updated 2 years ago
Alternatives and similar repositories for CogView
Users that are interested in CogView are comparing it to the libraries listed below
Sorting:
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆955Updated 3 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,437Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆550Updated 2 years ago
- ☆1,580Updated 3 years ago
- Deep Learning Examples☆827Updated 11 months ago
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,089Updated 9 months ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆781Updated last year
- A unified 3D Transformer Pipeline for visual synthesis☆2,807Updated 2 years ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,536Updated last year
- Taming Transformers for High-Resolution Image Synthesis☆6,324Updated last year
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,335Updated 2 years ago
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,981Updated last year
- ☆3,036Updated 2 years ago
- [CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer☆1,687Updated 2 years ago
- The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)☆956Updated last year
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆912Updated last year
- ☆1,479Updated last year
- Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)☆4,117Updated 2 years ago
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,346Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,629Updated last year
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,661Updated last year
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,308Updated last year
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,514Updated last year
- ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab☆2,044Updated last year
- ☆1,185Updated 3 years ago
- Pretrained Dalle2 from laion☆502Updated 2 years ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,653Updated last month
- GLM (General Language Model)☆3,313Updated last year
- ☆1,051Updated last year
- ☆3,392Updated last year