BurakCanBiner / SonicDiffusion
☆17Updated 4 months ago
Related projects: ⓘ
- [CVPR2024] Official PyTorch implementation of "Contrastive Denoising Score(CDS) for Text-guided Latent Diffusion Image Editing"☆82Updated 5 months ago
- ☆25Updated 2 months ago
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆113Updated 2 months ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆77Updated 5 months ago
- Website source files for Diffusion2GAN Project.☆76Updated last week
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- ☆75Updated last year
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆75Updated 6 months ago
- ☆30Updated 10 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆89Updated 3 weeks ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆62Updated 7 months ago
- ☆89Updated 9 months ago
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆47Updated 6 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆91Updated 2 months ago
- Training-Free Condition-Guided Text-to-Video Generation☆53Updated 8 months ago
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆117Updated 4 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆84Updated 6 months ago
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)☆85Updated 4 months ago
- Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)☆127Updated 3 months ago
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆76Updated 2 weeks ago
- ☆23Updated 4 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆107Updated 2 months ago
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path☆65Updated last year
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆33Updated 5 months ago
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆27Updated 3 months ago
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆15Updated 2 weeks ago
- The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆85Updated last month
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated 9 months ago
- A novel method that provides greater control over generated images by guiding the internal representations of the pre-trained Stable Diff…☆31Updated 7 months ago
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆49Updated last week