Mountchicken / Structured_Dreambooth_LoRA
Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Structured_Dreambooth_LoRA
- ☆82Updated 10 months ago
- ☆88Updated 3 months ago
- ☆55Updated last year
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆60Updated 2 months ago
- Official implementation of High Fidelity Scene Text Synthesis.☆36Updated 2 months ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20…☆42Updated 4 months ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆132Updated 6 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- ☆119Updated last month
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆20Updated 7 months ago
- TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control☆18Updated last week
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆121Updated 3 weeks ago
- Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want☆60Updated 3 weeks ago
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆41Updated last month
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆28Updated 2 months ago
- The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"☆47Updated last month
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆53Updated 3 weeks ago
- ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆33Updated 4 months ago
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆72Updated 7 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆100Updated 4 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 3 months ago
- ICCV2023-Diffusion-Papers☆110Updated last year
- ☆22Updated last year
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆74Updated 6 months ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆116Updated last month
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆109Updated 2 months ago
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆68Updated 4 months ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆26Updated 6 months ago
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆82Updated 2 months ago
- Official code for paper: Desigen: A Pipeline for Controllable Design Template Generation [CVPR'24]☆63Updated 3 months ago