THUDM / CogView2
official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"
☆943Updated 2 years ago
Related projects: ⓘ
- Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023☆1,309Updated last year
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆540Updated last year
- Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".☆1,682Updated 11 months ago
- Pretrained Dalle2 from laion☆499Updated last year
- Official implementation of VQ-Diffusion☆877Updated 5 months ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆740Updated last month
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆521Updated 9 months ago
- Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch☆1,224Updated 4 months ago
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,898Updated 4 months ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆848Updated 6 months ago
- ☆2,895Updated last year
- ☆1,158Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,355Updated last year
- CLIP+MLP Aesthetic Score Predictor☆861Updated 2 months ago
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,192Updated last year
- Deep Learning Examples☆804Updated 7 months ago
- Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion☆1,278Updated last year
- StyleGAN-Human: A Data-Centric Odyssey of Human Generation☆1,136Updated 5 months ago
- ☆3,050Updated 4 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,534Updated 8 months ago
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆737Updated 11 months ago
- ☆963Updated 4 months ago
- ☆1,439Updated 8 months ago
- Zero-shot Image-to-Image Translation [SIGGRAPH 2023]☆1,057Updated 2 weeks ago
- [ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis☆1,144Updated last year
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆877Updated last year
- Easily compute clip embeddings and build a clip retrieval system with them☆2,355Updated 5 months ago
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆3,599Updated last month
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆677Updated 11 months ago
- v objective diffusion inference code for PyTorch.☆711Updated last year