Mountchicken / Structured_Dreambooth_LoRALinks
Dreambooth (LoRA) with well-organized code structure. Naive adaptation from π€Diffusers.
β15Updated 2 years ago
Alternatives and similar repositories for Structured_Dreambooth_LoRA
Users that are interested in Structured_Dreambooth_LoRA are comparing it to the libraries listed below
Sorting:
- ECCV2024_Parrot Captions Teach CLIP to Spot Textβ66Updated last year
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.β77Updated 8 months ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wildβ32Updated last year
- β99Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPTβ136Updated last year
- β99Updated last year
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?β34Updated 6 months ago
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)β35Updated last month
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).β143Updated 7 months ago
- Text-To-Image Generation with Chinese Charactersβ130Updated 2 years ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matchingβ27Updated 5 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Updated last year
- [NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"β237Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"β44Updated 2 years ago
- A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generationβ84Updated last month
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Modelsβ73Updated last year
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024β79Updated last year
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".β122Updated 5 months ago
- Continuous diffusion for layout generationβ51Updated 9 months ago
- β17Updated 4 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generationβ33Updated 7 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.β70Updated last year
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023β129Updated 2 years ago
- [2024-NeurIPS] TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Controlβ96Updated 8 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editinβ¦β35Updated 6 months ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`β17Updated 2 years ago
- Codebase for the paper-Elucidating the design space of language models for image generationβ46Updated last year
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024β¦β56Updated last year
- Unified layout planning and image generation, ICCV2025β34Updated 7 months ago
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spottingβ44Updated 7 months ago