gligen / GLIGEN
Open-Set Grounded Text-to-Image Generation
☆2,113Updated last year
Alternatives and similar repositories for GLIGEN:
Users that are interested in GLIGEN are comparing it to the libraries listed below
- T2I-Adapter☆3,669Updated 10 months ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,029Updated last year
- [CVPR2024, Highlight] Official code for DragDiffusion☆1,215Updated last year
- Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with dive…☆1,738Updated last year
- Consistency Distilled Diff VAE☆2,184Updated last year
- ICLR 2024 (Spotlight)☆767Updated last year
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,313Updated last year
- Paint by Example: Exemplar-based Image Editing with Diffusion Models☆1,181Updated last year
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,384Updated 3 months ago
- Transfer the ControlNet with any basemodel in diffusers🔥☆827Updated 2 years ago
- This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".☆1,225Updated last year
- Open-source and strong foundation image recognition models.☆3,216Updated 2 months ago
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,843Updated 4 months ago
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆700Updated 3 months ago
- "Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)☆2,432Updated last year
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,604Updated 9 months ago
- PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation☆5,224Updated 9 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,070Updated 6 months ago
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"☆823Updated last year
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,413Updated last year
- Unified Controllable Visual Generation Model☆643Updated 3 months ago
- [NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"☆4,576Updated 8 months ago
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,895Updated 3 months ago
- [AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using …☆1,315Updated last year
- Speed up Stable Diffusion with this one simple trick!☆1,342Updated last year
- [ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing☆1,428Updated last year
- InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)☆1,302Updated 11 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,559Updated last year
- CLIP+MLP Aesthetic Score Predictor☆1,071Updated 10 months ago
- Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)☆3,382Updated 2 months ago