gligen / GLIGEN
Open-Set Grounded Text-to-Image Generation
☆2,016Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for GLIGEN
- T2I-Adapter☆3,482Updated 4 months ago
- Consistency Distilled Diff VAE☆2,137Updated last year
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆999Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,542Updated 10 months ago
- Paint by Example: Exemplar-based Image Editing with Diffusion Models☆1,109Updated 11 months ago
- [CVPR2024, Highlight] Official code for DragDiffusion☆1,165Updated 9 months ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,177Updated last month
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,809Updated 3 weeks ago
- Transfer the ControlNet with any basemodel in diffusers🔥☆813Updated last year
- ICLR 2024 (Spotlight)☆726Updated 8 months ago
- Speed up Stable Diffusion with this one simple trick!☆1,287Updated 11 months ago
- Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)☆3,334Updated 8 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,283Updated 4 months ago
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,693Updated last month
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆568Updated 3 months ago
- Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"☆976Updated last year
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability☆901Updated last year
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,757Updated last month
- Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)☆2,716Updated 11 months ago
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,362Updated 4 months ago
- ☆3,137Updated 6 months ago
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,830Updated 4 months ago
- CLIP+MLP Aesthetic Score Predictor☆904Updated 4 months ago
- Transparent Image Layer Diffusion using Latent Transparency☆2,023Updated 5 months ago
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"☆784Updated last year
- Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.☆1,766Updated 8 months ago
- A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.☆3,452Updated this week
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,324Updated 2 months ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,374Updated last year
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,289Updated last year