hananshafi / llmblueprint
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
β71Updated 8 months ago
Alternatives and similar repositories for llmblueprint:
Users that are interested in llmblueprint are comparing it to the libraries listed below
- β118Updated 6 months ago
- π₯ [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)β164Updated 9 months ago
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Modelsβ60Updated 3 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.β45Updated 3 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"β61Updated 8 months ago
- Training code for CLIP-FlanT5β22Updated 6 months ago
- A Large Multimodal Model for Pixel-Level Visual Grounding in Videosβ39Updated last month
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)β54Updated last year
- ECCV2024_Parrot Captions Teach CLIP to Spot Textβ63Updated 4 months ago
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.β90Updated 10 months ago
- (arXiv.2405.18406) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narrativesβ32Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ83Updated 6 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Attenβ¦β35Updated last month
- β75Updated 2 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generationβ27Updated last month
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".β116Updated last week
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ90Updated 9 months ago
- Official repo for StableLLAVAβ94Updated last year
- (CVPR 2024) 𧩠TokenCompose: Text-to-Image Diffusion with Token-level Supervisionβ118Updated last month
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)β44Updated last year
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)β81Updated last month
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videosβ95Updated last month
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)β96Updated 8 months ago
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learningβ41Updated last month
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspectiveβ59Updated 2 months ago
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captionsβ122Updated 2 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generationβ78Updated 9 months ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesisβ55Updated 2 months ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Modelsβ61Updated 8 months ago
- β57Updated last year