THUDM / CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,770Updated last year
Alternatives and similar repositories for CogView:
Users that are interested in CogView are comparing it to the libraries listed below
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆949Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆546Updated 2 years ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,521Updated 11 months ago
- ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab☆2,039Updated last year
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,121Updated 7 months ago
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,327Updated last year
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆3,966Updated 7 months ago
- Taming Transformers for High-Resolution Image Synthesis☆6,081Updated 7 months ago
- Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)☆4,067Updated last year
- (ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.☆2,299Updated last month
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,487Updated 11 months ago
- Sketch Your Own GAN: Customizing a GAN model with hand-drawn sketches.☆714Updated last year
- The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)☆846Updated last year
- Pretrained Dalle2 from laion☆501Updated last year
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆7,262Updated last year
- Deep Learning Examples☆820Updated 5 months ago
- Official PyTorch repo for JoJoGAN: One Shot Face Stylization☆1,424Updated 2 years ago
- ☆2,971Updated 2 years ago
- ☆3,261Updated 10 months ago
- [ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列☆1,054Updated 9 months ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆708Updated last year
- CLIP+MLP Aesthetic Score Predictor☆1,020Updated 8 months ago
- Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch☆8,214Updated 5 months ago
- Official PyTorch implementation of StyleGAN3☆6,620Updated last year
- [TOG 2022] SofGAN: A Portrait Image Generator with Dynamic Styling☆772Updated last year
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,235Updated 2 years ago
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,257Updated 8 months ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,405Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,609Updated last year
- Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP☆468Updated 3 years ago