CasualGANPapers/Make-A-Scene

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CasualGANPapers/Make-A-Scene)

CasualGANPapers / Make-A-Scene

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

☆335

Alternatives and similar repositories for Make-A-Scene

Users that are interested in Make-A-Scene are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

omriav / blended-diffusion
View on GitHub
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
☆589Jun 4, 2024Updated 2 years ago
self-distilled-stylegan / self-distilled-internet-photos
View on GitHub
☆242Apr 5, 2022Updated 4 years ago
autonomousvision / stylegan-xl
View on GitHub
[SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets
☆995Jun 24, 2024Updated 2 years ago
lucidrains / nuwa-pytorch
View on GitHub
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
☆548Jan 17, 2023Updated 3 years ago
microsoft / VQ-Diffusion
View on GitHub
Official implementation of VQ-Diffusion
☆981Apr 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
gnobitab / FuseDream
View on GitHub
☆194Dec 7, 2021Updated 4 years ago
pschaldenbrand / StyleCLIPDraw
View on GitHub
Styled text-to-drawing synthesis method. Featured at IJCAI 2022 and the 2021 NeurIPS Workshop on Machine Learning for Creativity and Desi…
☆280Nov 15, 2022Updated 3 years ago
lucidrains / parti-pytorch
View on GitHub
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
☆537Dec 8, 2023Updated 2 years ago
rinongal / StyleGAN-nada
View on GitHub
☆1,193Sep 29, 2022Updated 3 years ago
multimodalart / majesty-diffusion
View on GitHub
Majesty Diffusion by @Dango233(@Dango233max) and @apolinario (@multimodalart)
☆275Jul 25, 2022Updated 4 years ago
Jack000 / glid-3-xl
View on GitHub
1.4B latent diffusion model fine tuning
☆265May 16, 2022Updated 4 years ago
drboog / Lafite
View on GitHub
Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)
☆184Mar 23, 2023Updated 3 years ago
cientgu / VQ-Diffusion
View on GitHub
☆487Jun 30, 2022Updated 4 years ago
zai-org / CogView2
View on GitHub
official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"
☆955Aug 3, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
crowsonkb / v-diffusion-pytorch
View on GitHub
v objective diffusion inference code for PyTorch.
☆719Nov 29, 2022Updated 3 years ago
energy-based-model / Compositional-Visual-Generation-with-Composable-Diffusion-Models-PyTorch
View on GitHub
[ECCV 2022] Compositional Generation using Diffusion Models
☆489Apr 24, 2025Updated last year
dmarx / Multi-Modal-Comparators
View on GitHub
Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP
☆39Nov 26, 2022Updated 3 years ago
Jack000 / glid-3
View on GitHub
combination of OpenAI GLIDE and Latent Diffusion
☆136Apr 7, 2022Updated 4 years ago
rinongal / textual_inversion
View on GitHub
☆3,055Feb 27, 2023Updated 3 years ago
openai / glide-text2im
View on GitHub
GLIDE: a diffusion-based text-conditional image synthesis model
☆3,685Mar 8, 2024Updated 2 years ago
CasualGANPapers / StyleGANXL-CLIP
View on GitHub
A notebook for text-based guided image generation using StyleGANXL and CLIP.
☆58May 19, 2023Updated 3 years ago
google-research / parti
View on GitHub
☆1,590Jun 28, 2022Updated 4 years ago
yuval-alaluf / hyperstyle
View on GitHub
Official Implementation for "HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing" (CVPR 2022) https://arxiv.org/abs/…
☆1,027Sep 17, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
PITI-Synthesis / PITI
View on GitHub
PITI: Pretraining is All You Need for Image-to-Image Translation
☆502Jun 2, 2024Updated 2 years ago
zai-org / CogView
View on GitHub
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,798Sep 25, 2023Updated 2 years ago
LAION-AI / notebooks
View on GitHub
A collection of generative and training notebooks getting mirrored to google colab.
☆12May 29, 2022Updated 4 years ago
JD-P / cloob-latent-diffusion
View on GitHub
CLOOB Conditioned Latent Diffusion training and inference code
☆113Apr 15, 2022Updated 4 years ago
SHI-Labs / Versatile-Diffusion
View on GitHub
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
☆1,334Aug 10, 2023Updated 2 years ago
omerbt / Text2LIVE
View on GitHub
Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)
☆888Mar 9, 2023Updated 3 years ago
afiaka87 / clip-guided-diffusion
View on GitHub
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
☆459Dec 31, 2025Updated 6 months ago
mehdidc / feed_forward_vqgan_clip
View on GitHub
Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt
☆140Jan 3, 2024Updated 2 years ago
gwang-kim / DiffusionCLIP
View on GitHub
[CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models
☆867Mar 27, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dome272 / Paella
View on GitHub
Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
☆748Oct 4, 2023Updated 2 years ago
google / prompt-to-prompt
View on GitHub
☆3,456May 14, 2024Updated 2 years ago
NVlabs / denoising-diffusion-gan
View on GitHub
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs https://arxiv.org/abs/2112.07804
☆759Dec 2, 2022Updated 3 years ago
LAION-AI / ldm-finetune
View on GitHub
Home of `erlich` and `ongo`. Finetune latent-diffusion/glid-3-xl text2image on your own data.
☆182Aug 5, 2022Updated 3 years ago
zhangqianhui / 3DSGAN
View on GitHub
The code of '3D-Aware Semantic-Guided Generative Model for Human Synthesis' (ECCV 2022)
☆36Jul 18, 2022Updated 4 years ago
adobe-research / sam_inversion
View on GitHub
[CVPR 2022] GAN inversion and editing with spatially-adaptive multiple latent layers
☆174Jan 21, 2023Updated 3 years ago
betterze / StyleAlign
View on GitHub
☆152Sep 28, 2022Updated 3 years ago