gligen / GLIGENLinks
Open-Set Grounded Text-to-Image Generation
☆2,169Updated last year
Alternatives and similar repositories for GLIGEN
Users that are interested in GLIGEN are comparing it to the libraries listed below
Sorting:
- [CVPR2024, Highlight] Official code for DragDiffusion☆1,239Updated last year
- T2I-Adapter☆3,758Updated last year
- Paint by Example: Exemplar-based Image Editing with Diffusion Models☆1,227Updated last year
- Consistency Distilled Diff VAE☆2,201Updated 2 years ago
- Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" …☆1,046Updated 2 years ago
- FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)☆1,888Updated 10 months ago
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,559Updated last year
- This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".☆1,293Updated last year
- Transfer the ControlNet with any basemodel in diffusers🔥☆843Updated 2 years ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆3,220Updated last year
- Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with dive…☆1,764Updated 2 years ago
- [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation☆1,561Updated 2 weeks ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,445Updated 2 years ago
- CLIP+MLP Aesthetic Score Predictor☆1,204Updated last year
- [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)☆1,828Updated 9 months ago
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability☆947Updated 2 years ago
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆710Updated 10 months ago
- ☆995Updated 2 years ago
- [AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using …☆1,347Updated last year
- Image to prompt with BLIP and CLIP☆2,914Updated last year
- Unified Controllable Visual Generation Model☆651Updated 9 months ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,336Updated 2 years ago
- ICLR 2024 (Spotlight)☆776Updated last year
- ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information☆616Updated last year
- Official Code for MotionCtrl [SIGGRAPH 2024]☆1,465Updated 8 months ago
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,761Updated 4 months ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆756Updated last year
- The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥☆821Updated last year
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"☆1,153Updated 2 years ago
- ☆3,406Updated last year