gligen / GLIGEN
Open-Set Grounded Text-to-Image Generation
☆2,007Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for GLIGEN
- T2I-Adapter☆3,465Updated 4 months ago
- Consistency Distilled Diff VAE☆2,135Updated last year
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆995Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,159Updated last month
- Transfer the ControlNet with any basemodel in diffusers🔥☆811Updated last year
- [CVPR2024, Highlight] Official code for DragDiffusion☆1,162Updated 9 months ago
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,741Updated last month
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability☆898Updated 11 months ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆731Updated 11 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,537Updated 10 months ago
- This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".☆1,124Updated 10 months ago
- CLIP+MLP Aesthetic Score Predictor☆898Updated 4 months ago
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆566Updated 3 months ago
- Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"☆971Updated last year
- Open-source and strong foundation image recognition models.☆2,860Updated 3 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,784Updated last week
- Speed up Stable Diffusion with this one simple trick!☆1,285Updated 11 months ago
- ICLR 2024 (Spotlight)☆723Updated 8 months ago
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,343Updated 3 months ago
- Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with dive…☆1,675Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,689Updated 3 weeks ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,288Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,231Updated 4 months ago
- ☆3,123Updated 5 months ago
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,824Updated 4 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,395Updated last month
- A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.☆3,401Updated this week
- Latte: Latent Diffusion Transformer for Video Generation.☆1,698Updated last month
- Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)☆3,318Updated 8 months ago
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆603Updated 3 months ago