THUDM / CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,719Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CogView
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆949Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆544Updated last year
- Taming Transformers for High-Resolution Image Synthesis☆5,792Updated 3 months ago
- [TOG 2022] SofGAN: A Portrait Image Generator with Dynamic Styling☆764Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,368Updated last year
- A unified 3D Transformer Pipeline for visual synthesis☆2,809Updated last year
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆750Updated 3 months ago
- ☆1,546Updated 2 years ago
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,317Updated last year
- [CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer☆1,642Updated last year
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,418Updated 6 months ago
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,250Updated 6 months ago
- OpenAI CLIP text encoders for multiple languages!☆760Updated last year
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,210Updated 2 years ago
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,918Updated 6 months ago
- Karras et al. (2022) diffusion models for PyTorch☆2,313Updated 3 months ago
- Deep Learning Examples☆808Updated 3 weeks ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆689Updated last year
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,212Updated 3 months ago
- Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)☆3,997Updated last year
- ☆3,123Updated 5 months ago
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,541Updated 8 months ago
- Simple image captioning model☆1,313Updated 5 months ago
- ☆1,157Updated 2 years ago
- [SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets☆964Updated 4 months ago
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆3,699Updated 3 months ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,569Updated 8 months ago
- Official repo for consistency models.☆6,152Updated 7 months ago
- Official PyTorch repo for JoJoGAN: One Shot Face Stylization☆1,418Updated 2 years ago