HKUST-LongGroup / CLIPDragLinks
[ICLR2025] Official code for Combining Text-based and Drag-based Editing for Precise and Flexible Image Editing.
☆15Updated last month
Alternatives and similar repositories for CLIPDrag
Users that are interested in CLIPDrag are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆24Updated 5 months ago
- This is a repository to collect training-free algorithms for visual generation and manipulation☆55Updated this week
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editing☆78Updated 6 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆32Updated 2 months ago
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆50Updated 2 months ago
- ☆23Updated last year
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆154Updated 4 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆119Updated 4 months ago
- My implement of InstantBooth☆11Updated last year
- Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Func…☆24Updated 6 months ago
- ☆43Updated last month
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆120Updated 6 months ago
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆30Updated 4 months ago
- ☆21Updated 4 months ago
- A simple example for using `DDIMInverseScheduler` for inverting an input image to StableDiffusion's latent space☆81Updated last year
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆103Updated last year
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆61Updated last year
- Official Implementation of VideoDPO☆105Updated this week
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆71Updated last year
- ☆33Updated 7 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆30Updated 2 months ago
- ☆13Updated 4 months ago
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation☆24Updated last year
- Decoupled Textual Embeddings for Customized Image Generation (AAAI 2024)☆26Updated last year
- ☆23Updated last month
- Frequency Autoregressive Image Generation with Continuous Tokens☆76Updated last week
- [ECCV 2024] Official repository of ECCV 2024 paper: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion M…☆14Updated 2 weeks ago
- An unofficial implementation of the paper “DiffEdit: Diffusion-based semantic image editing with mask guidance”☆35Updated last year
- ☆16Updated 9 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆49Updated 4 months ago