DLYuanGod / ArtGPT-4
Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4
☆24Updated last year
Related projects: ⓘ
- Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".☆17Updated 7 months ago
- ☆42Updated 2 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆98Updated 4 months ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆74Updated 4 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆76Updated 5 months ago
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆69Updated 5 months ago
- Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space"☆43Updated 5 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆56Updated 2 months ago
- VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆93Updated last month
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆42Updated last week
- ☆78Updated 3 weeks ago
- ☆34Updated 5 months ago
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆46Updated 5 months ago
- ☆58Updated 10 months ago
- ☆37Updated 2 months ago
- ☆93Updated last year
- EditWorld: Simulating World Dynamics for Instruction-Following Image Editing☆109Updated 2 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆36Updated last month
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆61Updated 4 months ago
- The codes of Siggraph Asia 2024 paper "Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation"☆25Updated 3 weeks ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆93Updated 4 months ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆48Updated 3 months ago
- A Diffusion training toolbox based on diffusers and existing SOTA methods, including Dreambooth, Texual Inversion, LoRA, Custom Diffusion…☆73Updated 3 weeks ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated last week
- ☆80Updated last year
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆103Updated 3 months ago
- PartCraft: Crafting Creative Objects by Parts (ECCV2024)☆75Updated last week
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆17Updated 9 months ago
- Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions☆125Updated last month