THUDM / CogViewLinks
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,791Updated last year
Alternatives and similar repositories for CogView
Users that are interested in CogView are comparing it to the libraries listed below
Sorting:
- official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"☆953Updated 2 years ago
- Taming Transformers for High-Resolution Image Synthesis☆6,220Updated 10 months ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,424Updated 2 years ago
- Official repo for consistency models.☆6,360Updated last year
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence L…☆2,504Updated last year
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,333Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,614Updated last year
- Official implementation of VQ-Diffusion☆945Updated last year
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆549Updated 2 years ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,574Updated last year
- SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.☆1,081Updated 6 months ago
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,336Updated 10 months ago
- Deep Learning Examples☆824Updated 8 months ago
- A unified 3D Transformer Pipeline for visual synthesis☆2,808Updated 2 years ago
- ☆1,181Updated 2 years ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆775Updated 10 months ago
- ☆1,703Updated 8 months ago
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆891Updated 2 years ago
- Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744☆925Updated 10 months ago
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,641Updated last year
- GLM (General Language Model)☆3,240Updated last year
- The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)☆903Updated last year
- PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations☆1,102Updated 2 years ago
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"☆1,149Updated last year
- Official PyTorch implementation of StyleGAN3☆6,723Updated last year
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,284Updated 11 months ago
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,248Updated 2 years ago
- Official PyTorch repo for JoJoGAN: One Shot Face Stylization☆1,429Updated 2 years ago
- Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。☆4,131Updated 10 months ago
- Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.☆387Updated 2 years ago