gligen / GLIGENLinks
Open-Set Grounded Text-to-Image Generation
☆2,191Updated last year
Alternatives and similar repositories for GLIGEN
Users that are interested in GLIGEN are comparing it to the libraries listed below
Sorting:
- [CVPR2024, Highlight] Official code for DragDiffusion☆1,248Updated 2 years ago
- T2I-Adapter☆3,788Updated last year
- Paint by Example: Exemplar-based Image Editing with Diffusion Models☆1,243Updated 2 years ago
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,894Updated last year
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,559Updated 2 years ago
- This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".☆1,314Updated 2 years ago
- Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with dive…☆1,773Updated 2 years ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,056Updated 2 years ago
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability☆950Updated 2 years ago
- Consistency Distilled Diff VAE☆2,206Updated 2 years ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,269Updated last year
- Transfer the ControlNet with any basemodel in diffusers🔥☆846Updated 2 years ago
- CLIP+MLP Aesthetic Score Predictor☆1,251Updated last year
- Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)☆889Updated 2 years ago
- Unified Controllable Visual Generation Model☆657Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,843Updated last year
- ☆3,435Updated last year
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"☆1,159Updated 2 years ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆757Updated 2 years ago
- [AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using …☆1,355Updated last year
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,342Updated 2 years ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,626Updated 3 months ago
- Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"☆1,010Updated 2 years ago
- Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs☆1,937Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆712Updated last year
- Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)☆3,429Updated 11 months ago
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆620Updated last year
- ☆1,018Updated 2 years ago
- General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX☆1,841Updated 2 years ago
- [NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models☆666Updated last year