JarrentWu1031 / SingleInsertLinks
Official pytorch implementation for SingleInsert
☆27Updated last year
Alternatives and similar repositories for SingleInsert
Users that are interested in SingleInsert are comparing it to the libraries listed below
Sorting:
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆41Updated last year
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆46Updated 2 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆64Updated last week
- ☆28Updated 3 months ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆25Updated 6 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- ☆28Updated 3 months ago
- The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"☆27Updated 3 months ago
- ☆26Updated 3 months ago
- ☆25Updated 10 months ago
- ☆11Updated last year
- Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models☆28Updated 9 months ago
- ☆14Updated last year
- ☆40Updated last week
- ☆43Updated 8 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆70Updated 5 months ago
- ☆20Updated 9 months ago
- ☆64Updated 2 years ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆50Updated last year
- ☆20Updated 11 months ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆15Updated 9 months ago
- Video Diffusion Transformers are In-Context Learners☆23Updated 5 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated 9 months ago
- ☆24Updated last year
- ☆21Updated last year
- code for paper "Compositional Text-to-Image Synthesis with Attention Map Control of Diffusion Models"☆42Updated last year
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆53Updated 2 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆28Updated 2 weeks ago
- EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing☆28Updated last year
- Balanced Image Stylization with Style Matching Score☆29Updated 2 months ago