zyang-ur / idea2img
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024
β20Updated last year
Alternatives and similar repositories for idea2img:
Users that are interested in idea2img are comparing it to the libraries listed below
- ποΈ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"β104Updated 9 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Modelβ42Updated 6 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Modelβ45Updated 5 months ago
- β39Updated last year
- [ECCV2024] PartCraft: Crafting Creative Objects by Partsβ87Updated last month
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"β66Updated 2 months ago
- β47Updated 2 months ago
- Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editingβ23Updated 2 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.β46Updated 4 months ago
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Modelsβ51Updated 6 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"β62Updated 10 months ago
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"β108Updated 4 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"β40Updated 6 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Methodβ26Updated 10 months ago
- β40Updated 7 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generationβ29Updated 3 months ago
- β61Updated 2 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"β44Updated 2 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"β95Updated last year
- UniCon: A Simple Approach to Unifying Diffusion-based Conditional Generation (ICLR 2025)β20Updated this week
- T2VScore: Towards A Better Metric for Text-to-Video Generationβ78Updated 10 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"β46Updated 4 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Attenβ¦β36Updated last week
- [NeurIPS 2024] Official Implementation of Attention Interpolation of Text-to-Image Diffusionβ93Updated 3 months ago
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editingβ52Updated 10 months ago
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Modelsβ27Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ83Updated 7 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"β67Updated 8 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generationβ100Updated 7 months ago