google / break-a-scene
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]
☆521Updated last year
Alternatives and similar repositories for break-a-scene:
Users that are interested in break-a-scene are comparing it to the libraries listed below
- [SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images☆503Updated 5 months ago
- Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]☆598Updated 11 months ago
- PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.☆427Updated 11 months ago
- ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)☆534Updated last year
- LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…☆470Updated 7 months ago
- Official Pytorch Implementation of DenseDiffusion (ICCV 2023)☆493Updated last year
- [CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor☆515Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆700Updated 3 months ago
- [NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".☆351Updated 2 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆420Updated last year
- Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)☆734Updated last year
- Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis☆315Updated last year
- Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"☆539Updated last year
- LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation☆478Updated 5 months ago
- Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"☆367Updated last year
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆297Updated 9 months ago
- ☆501Updated 4 months ago
- Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]☆574Updated 11 months ago
- [ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.☆509Updated last year
- [WACV 2024] Training-Free Layout Control with Cross-Attention Guidance☆256Updated last year
- Zero-shot Image-to-Image Translation [SIGGRAPH 2023]☆1,117Updated 6 months ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆749Updated last year
- [ICLR 2025] HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models☆317Updated last year
- Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models☆325Updated 2 years ago
- [NIPS 2023] Official implementation for "DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models" https://arxi…☆272Updated 2 months ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆354Updated last year
- [NeurIPS'23] Emergent Correspondence from Image Diffusion☆685Updated 11 months ago
- ☆465Updated 7 months ago
- [CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"☆330Updated 11 months ago
- DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual A…☆479Updated last month