gligen / GLIGEN
Open-Set Grounded Text-to-Image Generation
☆2,106Updated last year
Alternatives and similar repositories for GLIGEN:
Users that are interested in GLIGEN are comparing it to the libraries listed below
- T2I-Adapter☆3,652Updated 9 months ago
- [CVPR2024, Highlight] Official code for DragDiffusion☆1,208Updated last year
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,838Updated 3 months ago
- Consistency Distilled Diff VAE☆2,175Updated last year
- This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".☆1,213Updated last year
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,041Updated 5 months ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,027Updated last year
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"☆823Updated last year
- Paint by Example: Exemplar-based Image Editing with Diffusion Models☆1,171Updated last year
- "Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)☆2,415Updated last year
- Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"☆998Updated last year
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,888Updated 3 months ago
- Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)☆3,375Updated last month
- Open-source and strong foundation image recognition models.☆3,168Updated last month
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,574Updated 8 months ago
- Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR …☆1,652Updated 2 months ago
- Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with dive…☆1,734Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,559Updated last year
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"☆1,145Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,790Updated 2 months ago
- Transfer the ControlNet with any basemodel in diffusers🔥☆827Updated last year
- ☆3,274Updated 11 months ago
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability☆930Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆747Updated last year
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆590Updated 8 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,836Updated 9 months ago
- Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).☆2,227Updated last year
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,308Updated last year
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,308Updated 3 months ago
- Segment Anything in High Quality [NeurIPS 2023]☆3,873Updated 4 months ago