zai-org / CogViewLinks
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,797Updated 2 years ago
Alternatives and similar repositories for CogView
Users that are interested in CogView are comparing it to the libraries listed below
Sorting:
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆956Updated 3 years ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,461Updated 2 years ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,549Updated last year
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,108Updated last year
- A unified 3D Transformer Pipeline for visual synthesis☆2,809Updated 2 years ago
- ☆1,587Updated 3 years ago
- Deep Learning Examples☆828Updated last year
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆549Updated 2 years ago
- ☆3,423Updated last year
- ☆3,044Updated 2 years ago
- ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab☆2,049Updated last year
- OpenAI CLIP text encoders for multiple languages!☆824Updated 2 years ago
- ☆1,190Updated 3 years ago
- Official implementation of "DCT-Net: Domain-Calibrated Translation for Portrait Stylization", SIGGRAPH 2022 (TOG); Multi-style cartooniza…☆824Updated 2 years ago
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,334Updated 2 years ago
- ☆1,033Updated 2 years ago
- Taming Transformers for High-Resolution Image Synthesis☆6,385Updated last year
- [TOG 2022] SofGAN: A Portrait Image Generator with Dynamic Styling☆773Updated 6 months ago
- ☆1,479Updated last year
- Pretrained Dalle2 from laion☆505Updated 2 years ago
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"☆1,157Updated 2 years ago
- Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。☆4,150Updated last year
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆791Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,631Updated last year
- [CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer☆1,691Updated 2 years ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,562Updated 2 years ago
- GLM (General Language Model)☆3,376Updated 2 years ago
- Code for Text2Human (SIGGRAPH 2022). Paper: Text2Human: Text-Driven Controllable Human Image Generation☆854Updated last year
- Official PyTorch repo for JoJoGAN: One Shot Face Stylization☆1,439Updated 3 years ago
- Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion